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ABSTRACT 

Examined were the effects of memory and inference 
questions on learning in 90 educable mentally retarded children (aged 
9 to 13 years) who were read a short story, each section of which was 
preceded or followed by a training question which cued relevant 
information. Results indicated that those questions which followed 
the presentation of sections of the story were more effective in 
increasing short term achievement than were those which preceded the 
sections of the story, that close temporal proximity between critic il 
information to be learned and the question led to greater learning, 
and that those subjects who received the same type of training 
question and criterion question performed better on the delayed 
relevant criterion tests (after 15 minutes) than did those who had a 
mixture of question types. No differences were found between groups 
on the incidental criterion tests.. Appendixes included the short 
story, the criterion tests, and the training questions. 

(Author/GW) 



O 

ERLC 



A ?* 

V 

i 

J 



i 



Center for Innovation in 
Teaching the Handicapped 

School of Education, Indiana University, Bloomington 



THE EFFECTS OF QUESTION TYPE AND POSITION 
ON FOUR TYPES OF LEARNING AMONO 
MENTALLY RETARDED CHILDREN 



STEPHEN B. HILLMAN 




i i*N 

o 

O 
" O 

* o 

roj 




TUB EFFECTS OF QUESTION TYPE AND POSITION 
ON FOUR TYPES OF LEARNING AMONG 
MENTALLY RETARDED CHILDREN! 

Stephen B. Hillman 

.July, 1972 

Final Report 17.3 



Center for Innovation in Teaching the Handicapped 



Indiana University 



U S DEPARTMENT OF HEALTH. 
EDUCATION & WELFARE 
OFFICE OF EOUCATION 
THIS OOCUMENT HAS BEEN REPRO 
OUCEO EXACTLY AS RECElVEO FROM 
THE PERSON OR ORGANIZATION ORIG 
INATING IT POINTS OF VIEW OR OPIN 
IONS STATEO 00 NOT NECESSARILY 
REPRESENT OFFICIAL OFFICE OF EOU 
CATION POSITION OR POLICY 



*This research was: supported by grant, #OEG 9-242178-4149-032 from the 
U. S. Office of Education, Bureau of Education for the Handicapped to 
the Center for Innovation in Teaching the Handicapped. Contractors 
undertaking such projects under Government sponsorship are encouraged to 
express freely their professional judgment in the conduct of the project. 
Points of view or opinions stated do not, therefore, necessarily repre- 
sent official Office of Education position or policy. 



2 



i 



I 

I 

t 

I 

i 

i 

i 

THE EFFECT OF QUESTION TYPE AND POSITION 
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Stephen B. Hillman 

Center for Innovation in Teaching the Handicapped 
Indiana University 

Abstract 

That questions play an important role in the daily institutional ac- 
tivities of teachers is supported by a considerable amount of descriptive 
research. However, the effects of these qxiestions on student learning 
has received little attention. The purpose of the present study was to 
examine the effects of questions on learning among mentally handicapped 
children. The four types of learning were identified as relevant remember 
ing, incidental remembering, relevant inferring, and incidental inferring. 

Educable mentally retarded (EMR) children were asked to listen to a 
short story which was broken down into 10 sections. Each section was 
either preceded or followed by a training question, the purpose of which 
was to cue the relevant information. There were two types of questions, 
remembering and inferring. Any given subject received only one type. 

After completion of the story there was a 15 minute rest period during 
which the children played with stick ^figures. This was followed by a 
20-item free-recall criterion test of two item types- -remembering and in- 
ferring, a given subject receiving only one type of item. This did not 
have to be of the same type as the training question, and indeed, in 
half the cases the training questions and criterion questions were of 
different types. 
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Results of this study indicated that those questions which followed 
the presentation of sections of the story were more effective in increas- • 
ing short-term achievement than were those which preceded the sections 
of the story; that close temporal proximity between critical information 
to be learned and the question lead to greater learning; and that those 
subjects who received the same type of training question and criterion 
question performed better on the delayed relevant criterion tests (after 
15 minutes) than did those who had a mixture of question types. No 
differences were found between groups on the incidental criterion tests. 

The results are discussed in terms of interference and memory 
theory and implications for the teaching of mentally handicapped chil- 
dren were identified. 
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PREFACE* 

One of the Center's continuing goals has been to study the effects 
of teacher interactive behavior on the learning of mentally handicapped 
pupils and to invent ways in which teachers may be better trained in an 
attempt to increase pupil achievement. 

The research study to be reported herein is part of a muv. i larger 
attempt to reach this goal by examining the effects of teacher cognitive 
demands on pupil learning. Considerable time and effort have gone into 
the development of the Individual Cognitive Demand Schedule (Lynch and 
Amos ,-10-7 lb) . This observation system systematically records the nature 
of the cognitive activity between teachers and students. The instrument 
has been used widely in descriptive studies of both normal and special 
classes. While considerable correlational evidence and theory exist to 
indicate the importance of teacher cognitive demands, little experimental 
data can be found to support the validity of statements concerning the j 
differential effects of different 



*The author wishes to thank Drs. William W. Lynch, Melvin I. Semmel, 
Richard L. Turner and Robert B. Cairns for their helpful suggestions through 
out the development and execution of this project. 
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Chapter I INTRODUCTION 
Statement of the Problem 

Children classified as being mentally retarded are character- 
ized by slow intellectual development, poor performance on intellectual 
tasks, andjjoorer ability in self-guided learning than their normal IQ 
counterparts. 

Different theories have been proposed to explain these observed 
inadequacies. Zigler (1966) focused on motivational factors. Others 
hypothesize deficits in attention (Semmel; 1965 ; Zeainan and House, 1963), 
short term memory (Ellis, 1963, 1970) and in organization of stimulus 
input (Spitz, 1966). • 

\ Denny (1964) hypothesized that th^. performance of mentally retarded 
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icit in incidpn 



ntal learning. The research 



children may be due to a defi 

\ . \ 

of House and Zeaman (1963) tends to support this view in demonstrating 

that Retarded children do\not. appear to attend to relevant cues, but 
that learning does progress^once attention is directed to the appropriate 



cues. Other research (Semmel and Williams, 1968) has shown mentally 
retarded children to be- inferior in both intentional and incidental learn- 
ing, but that retarded .chi ldren can achieve normal levels of incidental 
learning if given sufficient exposure to the stimulus material. Thus 
it appears that retarded children may require special attention and direc- 

X , 

tion in helping \hem to achieve certain knowledge and learning skills 
which come more easily to normal children. 

Several studies have shown that educable mentally retarded (EMR) 
children are capable of productive thinking (Cawley and Chase, 1967; 
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Prehm and Crosson, 1969; Rouse, 1965; Smith, 1967; Tisdall, 1962). Vet 
other research (Brophy and Good, 1970; Deutch, 1966; Lynch and Ames, 1971) 

i' 

indicates that low-ability children and children for whom teachers have 
low expectations, miss out on opportunities for intellectual stimulation 
from the teacher by having fewer questions directed at them and that 
teachers frequently call : on students they expect to give the right answer. 

A 

t 

Still, it remains to be demonstrated that questions are capable of being 

\ 

effective stimuli in increasing the achievement of retarded children. 



It was the attempt of the research to be reported here to study the effects 

' R 

of different types of questions on both intentional . and incidental learn- 

* >< 

ing for the purpose of increasing the achievement of educable mentally 
retarded children. 



\ . 



Review of Research 

The research pertinent to the hypotheses of this investigation 
are presented in three separate sections. The first concerns itself 

with studies of classroom questions, the second section with studies 

* 

using questions to produce mathemagenic behaviors, and the third section 
with characteristics of educable mentally retarded children germane to 
this study. 

Cla ss room Questions 

That questions play an important role in the daily instructional 
activities of teachers is supported by at least 60 years of research. 



Stevens (1912) estimated that 80% of the time spent in school was occu- 
pied by question-and-answer recitations. Floyd (1960) found an average 
of 384 questions being asked by primary-grade teachers in a typical day; 
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Moyer (1966) found an average number of 180 questions per elementary sci- 
ence lesson, and Schrieber (1967) found that fifth grade teachers asked 
an average of 128 questions per social studies lesson. A recent study 
(Lynch 8 Ames, 1971a) demonstrated that there were no significant differ- 
ences with respect to the number of questions asked per hour of instruc- 
tional time between normal and special education classes. 

Descriptive studies concerned with the type of question asked in 
verbal discourse have demonstrated that approximately two-thirds of the 
questions asked require direct recall of textbook information. Stevens 
(1912) was the first to describe this phenomenon. More recently Floyd 
(1960), Guszak (1967) and Schrieber (1967) have demonstrated similar 
percentages in elementary school situations; Gallagher (1965) and Davis 
and Tinsley (1967) have observed similar percentages in high schools. 

In fact these and other studies prompted Gall (1970) to conclude that 
teachers • questioning practices appear to be unchanged over this lengthy 
period. 

Results of a recent study on questions in normal and special edu- 
cation classrooms (Lynch 8 Ames, 1971a) showed that teachers in each type 
of class seem to demonstrate similar percentages of higher and lower 
questions asked during instruction. TVo other studies (Fine, Alien, 8 
Medvene, 1968; Minskoff, 1967) show that teachers of elementary mentally 
retarded qhildren use the greatest percentage of factual questions with 
teachers of normal elementary children using fewer and teachers of gifted 

high school children the fewest. 

, * 

\ ’ 

\ 
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While educators have been concerned with the type and frequency 
of questions asked in classrooms for a long time, and more recently 
with questions in special education classes, little research has been 
done on the effects of different types of questions. The few studies 
which have examined the effects of questions are reported here. The 
subjects for these studies were all normal children or college students. 
Thd author was unable to locate any studies using educable mentally re- 
tarded children as subjects. 

A study done by Hilda Taba (1966) sought to clarify the effects 
of teaching strategies on the cognitive functioning of elementary school 
children. Several findings from this study are of importance here. 

First, in both experimental and control groups, teachers Were^relatively 
successful in getting students to give the response they sought (a'finding 
later supported in a study by Lynch and Ames, 1972). Second, although 
not consistent among all experimental classes, evidence from tests devel- 
oped for this study showed that in ability to discriminate, to infer 
from data, and to apply known principles to new problems, the students 
of teachers who had been trained in the skills of the three cognitive 
tasks were superior to those in classes with untrained teachers. And 
thirdly, the results showed that- the use of specific teaching strategics 
designed to foster development of cognitive skills seemed to "make a dif- 
ference in the general productivity of thought in terms of both higher 

t 

levels and complexity of thought. "The most important observation that 
can be made from the data," Taba states, "is the centrality and power 
of the teacher's role in initiating cognitive operations and determining 




14 



* 



f 



I 5 

which kinds are open to students" (1966, p. 228). The fact that students 
generally gave what teachers sought indicates the power that a question- 
ing strategy has in determining those cognitive operations in which stu- 
dents engage. 

In examining the relationship between the variables of question 
type and. student achievement, Hunkins (1967, 1368) had one group of sixth 
grade subjects work exclusively with knowledge- level questions, while a 
second group worked with analysis-evaluation questions. Results showed 
that the analysis-evaluation group earned a significantly higher score on 
a specially-constructed posttest than did students who answered questions 

that stressed knowledge. While Hunkins’ findings are of interest, they 

' N . 

must be viewed with caution because 'of serious methodological considerations. 

Questions may also be raised about the definitions of analysis and evaluation 

» \ 

used in this study. ' 

Wright and Nuthall (1970) reported a study in which they explored 

\ 

the relationship between teacher behaviors and pupil achievement. Teacher 
behavior variables were identified from tape recordings and correlated 
with achievement test scores (developed on the basis of the lesson con- 
tent outline especially for this study) which had been corrected for pupil 
intelligence and prior knowledge. Among other findings, Wright and Nuthall 
reported that the mean class achievement scores correlated significantly 

\ 

with patterns and kinds of teacher questioning. Some teachers tended to 
ask one question at a time, while others frequently asked two or more in 
rapid succession in a single utterance. The data showed (a) that the 
tendency to ask one question at a time was positively related to achieve- 
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ment, (b) the tendency to ask several questions was negatively correlated 
with achievement, and (c) the greater the percentage of a teacher's questions 
which were closed (i.e., required single statements of fact, description, 
definition, naming) as opposed to open (i.e., required statements of opinion, 
evaluation, explanation, inference), the higher the achievement of the 
pupils. 

A fifth study having pupil cognitive learning as a dependent vari- 
able was conducted by Ladd and Anderson (1970) in which they investigated 
the effects of the level of inquiry of teachers' questions on the achieve- 
ment of 1000 ninth-grade earth science students in 40 classes. A median 
split was used in separating the 40 participating teachers into equal size 

l 

groups of low- and high-inquiry teachers based upon observations of their 
teaching behavior. Results of this study, with adjustments. made for in- 
telligence, show that the students of high-inquiry teachers performed 
significantly better on tests which contained (a) low inquiry questions 
only, (b) high inquiry questions, and (c) both high and low inquiry ques- 
tions. The between-group differences were significant beyond the .001 
level. The authors thus concluded that "teachers' questioning behavior 
strongly influences student achievement." (p. 398) 

Two investigations (Furst, 1967; Thompson & Bowers, 1968), study- 
ing the cognitive level of classroom discourse, found a positive relation- 
ship between the amount of cognitive variation employed by the teacher 
during classroom discourse and student criterion performance. 

In a study by Chall and Feldman (1966) teachers whom observers 
rated as emphasizing the stimulation of thought and skills tended to have 
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higher-achieving classes on reading subtest scores such as word reading, 
paragraph meaning, spelling and word study than their information and skill 
counterparts. Data from this study also indicated a positive relation- 
ship between student achievement and a rating of whether the cognitive level 
of the teacher's lesson appeared to be "just right most of the time." 

Several studies have been conducted in which questions were classi- 
fied into two types. The results of seven of these investigations are 
reported here. Of these seven, significant results were not obtained in 

three. In t' of these studies (Harris and Serwer, 1966; Harris et al., 

\ 

1968) attempts were made to correlate the OScAR-R subscales with reading 
achievement among disadvantaged children. In the third study Perkins (1965) 
attempted to correlate teacher behavior with lack of achievement among 
high ability students. A fourth study by Wright and Nuthall (1970) found 
open and closed questions unrelated to achievement when the frequencies 
were converted to percentages, a significant relationship was obtained 
with the closed questions, showing a positive relationship to pupil achieve- 
ment. Of the three remaining studies, all reported significant results. 
Kleinman (1964) reported that teachers with high-achieving junior high 
school students asked more "high-level" questions during their science 
lessons; Spaulding (1965) found "open-ended" questions to have a negative 
correlation with achievement among elementary students; and Thompson and 
Bowers (1968) reported that high achieving students had teachers who asked 
admixture of convergent and divergent questions. 

i / 

Rosensnine and Furst (1971) report that tKey had located two studies 
which used multiple classifications of teacher questions. Both of these 
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studies (Conners 8 Eisenberg, 1966; Solomon, Be zdek, 6 Rosenberg, 1963) 
had significant results. The Conners and Eisenberg ( 1966)'Study "in partic- 
ular demonstrated an important and strong relationship between intellectual 
activities of the teacher and increased pupil scores on the Peabody Picture 
Vocabulary Test (PPVT) . 

The general conclusion to be drawn from these studies seems to be 
that there is at least a correlational relationship between teacher ques- 
tioning behaviors and student achievement in normal children. Exactly 
what that relationship is remains unclear. The correlational evidence 
has indicated several profitable variables for more carefully controlled 
experimental studies. Some of these variables which appear to have a 
positive correlation with achievement are: (a) the tendency to ask one 

question at a time as opposed to several at once (Wright 6 Nuthall, 1970); 
(b) the amount of cognitive variation employed by the teacher (Furst, 

1967; Thompson 5 Bowers, 1968); (c) the opportunity the students have to 
learn at the cognitive level appropriate to the criterion achievement test 
(Chall 5 Feldman, 1966; Hunkvns, 1967, 1968; Ladd 6 Anderson, 1970; Taba, 
1966); and (d) the tendency to ask "closed" as opposed to "open" questions 
(Spaulding, 1965; Wright 6 Nuthall, 1970). 

In summary , most of the research on the effects of teacher questions 
has been of a correlational nature. The research has produced a number of 

interesting correlates with student achievement, but the validity of these 

i 

correlates needs to be further explored in experimental studies. 

Questions and Mathemagenic Behaviors 

In recent years a large number of studies have been conducted with 
respect to cognitive learning from written materials. E. Z. Rothkopf coined 

18 
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the term "mathemagenic" to refer to "those student, activities that are 
relevant to the achievement of specified instructional objectives in speci- 
fied situations or places" (Rothkopf, 1970). Rothkopf assumes that these 
student activities are modifiable, that if improperly controlled or uncon- 

| 

trolled may lead to irrelevant learning and that the learner adapts his 
activities to the requirements of training questions or orienting tasks 
(1970). In attempting to gather empirical evidence in support of these as- 
sumptions, Rothkopf and others have focused mainly on the influence of 
training questions on learning from written materials. The following 
selected studies will provide the reader with a general review of the nature 
of this work. 

Rothkopf (1966) investigated the effects of adjunct, test-like 
questions on learning from written materials. In this study he asked college 
students to read a 5,200 word selection from Rachel Carson's The Sea Around 
U£ which was broken down into seven sections of approximately equal length. 

As criteria he used a test composed of 14 questions intended to measure 

r 

specific learning resulting from experimental questions asked in the text 
and a test of 25 items not used as experimental questions, called a general 
test. There were six treatment groups: (1) "SBA" - subjects were given 

two questions shortly before each of the seven sections. After writing 
his guess, the subject was given the correct answer; (2) "SB" - same treat- 
ment as SBA except that the subject was not provided with the correct 
answer after he made his guess; (3) "LBA" - subjects were given all 14 
questions at once just before starting to read the chapter. They were 
given the correct answer after they made a guess on each question; (4) "SAA" - 
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; the subjects were given two questions immediately after each of the seven 

sections. The correct answer was provided as soon as the subject responded 
to each question; (5) "SA" - same treatment as SAA except that the correct 
answers were not provided after the subject gave his answer; (6) Control - 
no questions were given in the text. A direction reference group (QRG) 

, was added to evaluate the effects of questions compared to care-inducing 

directions . 

The results of the study indicated that the SA, SAA, and DRG groups 
' performed better on the 25-item general test than all other groups. Ail 

groups which received questions performed significantly better on the 14 
item specific learning test than those groups which did not. Those groups 
receiving answers after responding did better on the 14 item specific test 

than did those with no feedback. It was evident from these results that 

\ 

(1) asking questions after reading the relevant test passages facilitated 
both specific and general learning, (2) questions presented before the 
relevant text passage produced only question-specific facilitative effects, 
and (3) that question-specific-effects were most noticeable when the correct 
answer had been given to the subjects after they responded. Rothkopf con- 
cluded from this that adjunct questions, unlike specific directions, may 
shape effective inspection behavior and are also useful in teaching specific 
skills. 

A study by Rothkopf and Bisbicos (1967) hypothesized selective 
facilitative effects of interspersed questions on the learning of written 
materials. The 252 high school subjects were asked to read a 36-page, 
9000-word passage about animals and minerals found in the sea. Two questions 
appeared in the text per each three-page zone, but the questions differed 

20 
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in location (before or after the relevant segment) and in required response. 
Different treatment conditions saw questions restricted to one of the fol- 
lowing response types (a) either a quantitative term or name, (b) a common 
English or a technical word, (c) a mixture of (a) and (b) . Each treatment, 
group responded to the same 48-item criterion test after having read the 
passage. Results indicated that learning of the several categories of text 
content was facilitated by appropriate questions seen immediately after 
exposure to the relevant text segment as opposed to those seen before. 

In a somewhat related study using a similar experimental design, 
Frase (1968b) was able to replicate the finding that retention was highest 
when questions were placed after the appropriate material. The 128 college 
students in this experiment were asked to.-read a 2000-word passage con- 
cerning the life of William James. Instead of questions being placed at 
two or three page intervals, however, questions were paced at the rate of 
one every 10, 20, 40, or SO sentences. The data indicated that retention 
increased with the frequency of posttreatment questions, but it decreased 
with frequent pretreatment questions. Question mode (multiple choice or 
constructed response) in terms of questions appearing in the text was also 
a variable, but had no effect. 

The problem explored in the next study (Frase, 1968a) was to de- 
termine what happens to the retention of information contained in a passage 
when an orienting question is asked which requires the processing of a 
relatively large or small amount of the total information contained in that 
particular passage. Eighty-four college subjects were allowed 20 seconds 
to read a question and a 36-word paragraph. While the paragraph was the 
same for all subjects, the questions differed. One group of subjects read 
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a specific question, another group a comparative question, and the third 
group a general question. Each question was read by each experimental group 
before reading the paragraph. The results of the study showed that (a) the 
most precise question (i.e., specific > comparative > general) led to the 
most efficient acquisition of the specific stimulus-response association 
(i.e., more subjects in the specific question group passed the test item 
which was relevant to their question), and (b) when performance on the 
total retention test was the criterion, the groups scored in the same order 
(i.e., specific > comparative > general). While result (a) is consistent 
with experimenter's hypothesis, result (b) is just the opposite of what was 

y 

predicted. It was reasoned that general orienting questions would require 
the/subjects to process greater amounts of information and thus their gen- 
eral retention would be higher.- Though the results did not support this 
positon, they did supply evidence for the selective information rejection 
(attention position suggested by Berlyne (1965) and Schroder, Driver, and 
Streufert (1967) in which it is hypothesized that the greater uncertainty 
created by the comparative and general questions forced the subjects to 
engage in information rejection strategies in order to reduce the information 
load of the paragraph. Data derived in another project as part of this 
same study indeed add support to this position. To quote Frase, 'The gen- 
eral conclusion seems to be that as effective uncertainity or information 
load increases, precise control over reading behavior becomes more imper- 
ative" (p. 201). 

Another series of studies conducted by Frase (1969) and reported 
in monograph form induced subjects to think about text material by having 
them deduce conclusions from that material. The conceptual characteristics 
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of the text material were analyzed in order to permit predictions about which 
text items would enter memory as a function of different orienting directions. 
The ability to control learning from text material, it was stated, hinges 
upon an adequate understanding of this interactive process. It was clear 
from the results of the first experiment in this monograph that when a 
certain text item was a component of a problem solution, it had access to 
memory and was higher in recall^than if it was not part of the problem solu- 
tion. The basic hypothesis here is that, while subjects might scan an entire 

« 

passage for the information necessary to draw a certain inference as commun- 

\ 

\ 

icated by an orienting direction or question, the text which is not relevant 

to that conclusion will receive only minimal processing and not have access 

\ 

to\memory. This finding held for all three of the experiments in the mono- 
graph. Specifically though, experiment #1 showed that the recall of text 
items, which mediated problem solution was greater than for those text points 



which did not mediate problem solution; and experiments #2 and #3 demon- 
stratedythat inducing higher levels of information processing adds new 
items to memory and thus raises the over-all level of recall, but does 
not increase the number of correct inferences. 

A final study in this section concerns itself with the effects of 
written versus orally-communicated questions on learning from written materials 
(Rothkopf 8 Bloom, 1970). Sixty-three high school students studied a 16,000- 
word earth science text which was presented to them individually on 180 slides. 
In one experimental group, a written question related to the previous reading 
appeared after every sixth slide. The subject then wrote down his response 
on a piece of paper, but received no feedback. The second experimental group 
received an oral question asked by a teacher after every sixth slide. The 
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subject gave, his response, but received no feedback. The control group 

received no questions. The results indicated that the oral question group 

• • • ' ^ 
scored significantly higher on a recall criterion test than did the written 

question group, and that both groups scored significanlty higher than the 

control. 

In summarizing briefly, it seems clear that different questions can 

in fact produce different learning outcomes (c.g., Frase, 1968a, 1968b; 

\ 

Rothkopf and Bisbicos, 1967). It also seems clear that when learning from 
text material, questioning can produce more effective learning than no 
questioning (Rothkopf, 1966; Rothkopf f, Bisbicos, 1967). Questions which 
are asked after a subject reads a particular portion of the text as opposed 
to questions whid) are asked in advance of his reading also produce better 
learning (Frase, 1967, 1968a, b; Rothkopf, 1966; Rothkopf 8 Bisbicos, 1967). 
The most interesting result of the studies on adjunct questions, according 
to Rothkopf (1970), is that they demonstrate that mathemagenic activities 

are adaptive and that ’’the shaping of mathemagenic activities in an instruc- 

. " ■ i 

tional fashion by environmental events (or contingencies^ is a practical 

possibility (p. 333)." 

Unfortunately all of these studies have been done on college or 
high school students and it remains to be seen whether or not the behaviors 
of mentally handicapped children are modifiable under similar stimulus 
control and during a variety of learning tasks. Because of the consistency 
of the reported pre-post question results in favor of the post question in 
producing greater learning, thi3 would seem to be an important variable 
in testing the adaptability of the mentally handicapped to different question 



stimuli. 



Relevant Characteristics of Educable Mentally Retarded Children 

Mentally retarded children tend to be slow to develop intellectually 
(Robinson 6 Robinson, 1965) and to do poorly on intellectual tasks in gen- 
eral (Denny, 1964). Several attempts to explain this have postulated 
various types of deficits characteristic of the educable mentally retarded 
(EMR) child (e.g., deficits in attention to appropriate cues (Semmel, 1968; 
Zeaman and House, 1963); short-term memory (Ellis, 1963, 1970);, organiza- 
tional strategies (Spitz, 1966); and in the development of learning sets 
Kaufman and Prehm, 1966) . The following represents a brief review of theory 
and research dealing with each of these orientations. 

Zeaman and House (1963) have argued that the observed learning 
deficit of retarded children may be accounted for by a lack of attention. 
Stated more precisely, they theorize that (1) attention is limited to 
only one (or at most, a few) of the many possible stimulus 'dimensions avail- 
able to the subject at any particular moment, (2) subjects may learn to 

{ 

attend to or disregard stimulus aspects as a function of differential rein- 
forcement, and (3) cues for instrumental learning are those aspects of the 
stimulus which are being attended to (p. 212). Zeaman and 
on to state that 

If our analysis of retardation and attention is 
correct, the secret of successful training of 
moderately retarded children lies in the engin- 
eering of their attention. . . . one should seek 
ways of increasing the attention value of the . 

relevant cues. (p. 218) 

Drawing on the work of Zeaman and House (1963) , Denny (1964) hypoth- 
esized that the performance of mentally retarded children may be due to a 
deficit in incidental learning, and contended that the basic attentional 
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problem hypothesized by Zeaman and House (1963) may result in poor inci- 
dental learning. Several experimenters have reported results consistent 
with this hypothesis (e.g., Goldstein and Kass, 1961; Semmel and Williams, 
1968). Thus it appears that retarded children may require special direction 
and additional help in school learning situations. 

Research on short-term (STM) and long-term memory (LTM) involves 
considerable definitional problems. Distinctions between these two con- 
cepts are often vague and arbitrary. Nonetheless, several authors have 
proposed theoretical distinctions between STM and LTM ( Ellis , 1963, 1970; 
llebb, 1949; Peterson, 1966). The work of N. R. Ellis is discussed here 

S 

because it deals focally with the issue of memory and the retarded child. 

In 1963 Ellis hypothesized that the inadequate behavior of retardates 
was in part due to a stimulus trace diminished in both strength and duration. 
Further, he predicted that (1) when the performance of retardates and normals 
wero compared on tasks which required the bridging of a time gap, that 
the retardate's behavior would suffer in comparison; (2) as the magnitude 
of the temporal -separation between events increased, the retardate's per- 
formance would deteriorate even mere; and (3) that the stimulus trace would 
show a developmental trend with both strength and duration increasing as 
a function of age and intelligence. In theorizing about the role of memory 
in learning, Ellis predicted poorer performance by retardates as compared 
with normals on a wide range of tasks such as reaction time, delayed re- 
sponse, paired-associate learning as well as simple retention. 

Results of experiments designed to test the above theory led Ellis 
(1970) to hypothesize a multi -process conception for retention of supraspan 
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information. This reformulation postulated the presence of two memory 
processes -- primary memory (PM) and secondary memory (SM) . Research 
(Ellis f* Hope, 1968) had shown the recency and primacy segments of the 
serial position curve to be discontinuous processes. Primacy could be 
influenced by rate of presentation, or by delay prior to recall. It was 
hypothesized that rehearsal strategies may -be responsible for facilitating 
primacy performance. Rehearsal appeared to have no effect upon recency. 

Research results (Ellis, 1970) suggest that primary memory (PM) 
in the retardate and normal may not differ. But, while this recency per^— — 
formance is comparable in both retarded and normal subjects, adult and 
adolescent retardates display a lower ^primacy performance than normal sub- 
jects (Ellis, 1969). As is the case with normal subjects, simple rehearsal 
or labeling facilitates recency and depresses primacy effects in a probe- 
type memory task (Ellis, 1969). From these findings Ellis (1970) concluded 
that while normal subjects showed evidence of using both primary and secondary 
memory, in retardates the secondary memory process fails to function in 
the normal fashion. Ellis further concluded that active rehearsal strategies 
appeared necessary for secondary memory but not for primary memory and that 
the retardate's deficiency may be due to a failure of the rehearsal mech- 
anism^). Belmont and Butterfield (1969) argue in a similar fashion by 

suggesting that the short-term memory deficit of retardates results from 

/ 

and acquis i ton deficit which is probably due to a failure to actively re- 
hearse stimulus input after it enters primary memory rather than due to 
defective retention or retrieval. It seems plausible, Sitko and Semmel 
(1971^argue, to hypothesize a deficit in secondary memory which is deter- 



mined by "secondary organization rather than a deficit in primary or short- 



term memory. 
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Several Investigators have argued that the need for individuals 
to store a vast amount of information in a limited storage capacity requires 
the individual to impose some form' of information reduction on the stimulus 
input (Bruner, Goodnow and Austin, 19S6; Bruner, Oliver and Greenfield, 

1966; Handler , 1967; Miller, 1956). These views which characterize the 
child as an "active" learner deal primarily with the ability of an individual 
in selecting, processing, storing, and retrieving relevant information. 

Human memory, for these theorists, is essentially an active process of the 
organization of stimulus input by the learner. 

Spitz (1966) has suggested that the slower learning performance 
of mentally retarded children on cognitive and memory tasks may be due 
to the faulty or inefficient organization of stimulus material to be learned. 
Spitz maintains, however, that "the question is not whether or not retardates 
group or organize materials, but rather under what conditions, in what manner, 
and how efficiently they display this capacity" (p. 36). 

Semmel (1967) in examining differences in cognitive organizational 
strategies used by EMR and nonretarded children suggests that two funda- 
mental and qualitatively different strategies are involved. The first is 
called the sequential-associative strategy and the second is called the 
hierarchical strategy. EMR children, according to Semmel (1967), tend to 
use only the more primative sequential-associative strategy when processing 
language, while both the hierarchical and sequential-associative strategies 
are typically used by normal children. The sequential-associative strategy 
results in simple stimulus-response associations or sequentially dependent 
chains, while hierarchical processing results in the formation of a struc- 
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tured network of concepts, classes, systems and relationships and involves 
the abstraction of common attributes to form a generalized internal repre- 
sentation. Support for Senunel's positon has been demonstrated in several 
studies (e.g., Semmel and Bennett, 1970; Semmel, Barritt, and Bennett, 1970; 
Semmel , Barritt, Bennett, and Perfetti, 1968; Sitko, 1970). 

One of the clear advantages of the learning set approach to the 
study of fundamental learning processes is that one is able to systematically 
record an index of learning efficiency, or in some cases skill development, 
over a series of problems or trials. The major credit for the development 
of learning set evaluation techniques belongs to Harlow for repeatedly 
demonstrating that organisms can learn a series of discrete discrimination 
problems with pregressively greater efficiency (1959). 

, Learning set research with mentally retarded subjects has produced 
the following results: (1) mildly retarded children tend to produce signif- 

icantly more stimulus preservation errors than normal children of the same 
chronological age (Kaufman and Peterson, 1958, 1965); and (2) a relatively 
gross relationship exists between mental age or IQ and learning set form- 
ation (Ellis, 1958; Harter, 1965, 1967; Kaufman and Prehm, 1966; Stevenson 
and Swartz, 1958) with normal children doing better than the mentally re- 
tarded subjects. •, 

/ ^ 

! Though the above research tends to show that mentally retarded 

/ 

children exhibit difficulty in acquiring learning sets, they also demon- 
strate that learning sets may be acquired by those same subjects. Indeed, 

some studies show that with appropriate procedures (e.g., overlearning of 

! 

, i 

the original task) learning sets may be retained for as much as a year 
(Kaufman, 1971). 
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Objectives of the Present Study 

School situations place certain demands upon the learner. One of 
the most obvious is that what is learned in school is transmitted verbally 
in classrooms. In the case of normal children much of this is through writ- 
ten materials, especially as the child gets older. But because of poor 
reading skills, the retarded child continues to learn predominantly through 
the processing of verbal information which is transmitted aurally. In view 
of the need which mentally retarded children have for more directed and 
structured learning activities, it is important that means be found to 
supply this structure and thus to facilitate their cognitive learning 
during listening tasks. Questions have been found to play a large role ^ 
in the daily instructional activity of the classroom teacher, and indeed, 
some evidence suggests that questions may have positive effects on the 
learning of normal children. As of yet questions have not been demonstrated 
to have a significant influence on the behavior of mentally retarded chil- 
dren, though the cuing and structuring effects of questions offer consider- 
able theoretical, as well as practical, appeal. 

It was the intent of this study to demonstrate that questions have 
important theoretical and practical effects in enhancing the cognitive learn- 
ing of educable mentally retarded children. This study attempted to answer 
the following general questions: 

(1) Can questions be used effectively to facilitate the short 
term retention of information? 

x. 

(2) Can questions be used to facilitate the longer term retention 
of information (after 15 minute delay)? 
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(3) Does the manipulation of question position relative to con- 
tent material influence short-term retention? long-term 
retention (after 15 minutes)? 

(4) Can questions be used to effectively influence information 
processing strategies? 

(5) Can questions be effective in creating learning sets in 
mentally retarded children? 

Description of Learning Task 

The subjects in this study were asked to listen to a short mystery 
story of approximately 2400 words. The story was broken down into ten 
sections. In two of the experimental conditions subjects were asked one 
questions before each of the ten sections. In the other two experimental 
conditions subjects were each asked one question after hearing one of the 
ten sections. In all cases the subjects responded to the question after 
hearing the section pertaining to that question, but received no feedback 
to their responses. All subjects heard all ten sections and answered ten 
questions in total. A control group heard the story but received no ques- 
tions during the listening task. Half of the subjects received remember- 
ing questions throughout the listening task. The remaining half received 

# 

inferring questions. 

A 20-item criterion test was administered after a 15 minute delay. 
Ten of the items were the same questions the subject had been asked during 
the story. These are called relevant items. The remaining 10 items the 
subject had not been asked yet and these are called incidental items. 

See Table 1 for a pictorial display of the this design. 
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Table 1. Experimental Design 

* uncircled number is the subject number 
© - relevant and incidental remembering criterion test 
(2) ■ relevant and incidental inferring criterion test 
E » experimenter 
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Definitions 

1. RELEVANT QUESTIONS are defined as those questions which were asked by 

the experimenter during the listening activity. 

2. INCIDENTAL QUESTIONS are those items which relate to the material 

covered during the listening activity (i.e., information contained 
in the story), but which had not been asked before. 

3. REMEMBERING QUESTIONS are defined as those questions which require the 

child to recall something. It may be a straight forward summary 
or a piece of information which he is required to remember. 

4. INFERRING QUESTIONS call upon the child to make interpretations or to 

draw conclusions from data. The child must arrive at his own 
interpretation, deduction, or conclusion from available information. 

5. PRE-QUESTIONS are those questions which were asked in advance of the 

child's listening to a section of the story. 

6. POST-QUESTIONS are those questions which were asked after the child 

had heard a section of the story. 

Hypotheses 

As a result of the findings of previous studies cited in the review 
of the literature, the following hypotheses were tested in this investi- 
gation: 

On the analysis of between-trials scores (short-term retention) 

1 - Children receiving pre-questions would perform better than children 

receiving post-questions, 

2 - Children would score higher on trials where information necessary 

to answer the questions were in "close proximity" as opposed to 
"distant proximity" from the question. 
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3 - Children would perform better on trials where they received 

V 

a pre-question and the information was close as compared with 
when they received a post-question and the information was close. 

4 - Children who receive post-questions would show improvement over 

trials on between trials scores. 

5 - Children who receive pre-questions would show no improvement 

over trials on between trials scores. 

On analysis of delayed (after 15 minutes) criterion measures 

6 - Children who received questions would score higher on the relevant 

criterion test which asked the same questions as were in the 
training than children who were in the control group and did not 
receive training questions. 

7 - Children who received questions would perform better on the 

incidental criterion test than children who were in the control 
group and did not receive training questions. 

8 - Children who receive pre-questions would perform better on the 

relevant criterion test than children who received post-questions. 

9 - Children would porform better on the relevant subtest than on the 

incidental subtest. 

10 - Children who received remembering training questions would score 

higher on the remembering relevant criterion test than children 
• who had received inferring training questions. 

11 - Children who received inferring training questions would perform 

better on the inferring relevant criterion test than children who 
had received remembering training questions. 
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12 - Children who received remembering training questions would 

score higher on the remembering incidental criterion test than 
children who had received inferring training questions. 

13 - Children who received inferring training questions would per- 

form better on the inferring incidental criterion test than 
children who had received remembering training questions. 

14 - Total score criterion test results would be highest for children 

receiving the same type of criterion test as training question. 

15 - Children who received post-training questions would perform 

better than children who received pre-training questions on 
the incidental criterion test where the items in the test were 
of the same type as the training questions. 
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CHAPTER II 

METHOD AND PROCEDURES 

The preceding chapter described the review of related literature 
and listed the objectives and specific hypotheses of the present investi- 
gation. This chapter describes the design of the study, subjects, materials, 
experimenters, procedures and analysis of the data. 

Design of the Study 

The study was designed as a 3x2x2x2x6 repeated measures design 
with the repeated measure being location of information (i.e., answer) 
within the section of the story. Question position was the first factor 
with three levels: pre-question, post-question, or no question (control 

group). The second factor was question type: remembering or inferring 

questions. The third factor, location of answer in the section, was the 
repeated measure and had two levels: first half of the section or second 

half of the section. Criterion type was the fourth factor and consisted 
of two levels: remembering or inferring questions. The fifth factor was 

experimenter and there were six levels. 

There were four dependent variables: total between- trials score, 

relevant score after fifteen minute delay, incidental score after fifteen 
minute delay, and total score after fifteen minute delay. 

A total of 8 experimental groups and 2 control groups were involved 
in this study. Each group had 9 subjects. Subjects in the experimental 
groups received questions either before or after listening .to a section 
of the story. Each of the 10 sections of the story was accompanied by a 
different question but of the same type. All of the questions for any 
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particular subject were either pre-questions (before lie heard a particular 
section of the story) or post-questions (subject heard the question after 
he heard the appropriate section of the story) . In half of the sections 
of the story the answer to the question:, was contained in the first half 
of the section. The second half of the section contained the information 
necessary to answer the question in the remainder of the sections. Within 
these limitations, the position of the answer in a particular section was 
assigned at random. The subjects in the control groups received no questions 
between sections of the listening task. In place of the question there was 
a period of twenty seconds of silence. Then the story continued. 

After the fifteen minute delay-rest period where the subjects played 
with the Cuisenaire Rods, each subject was given a twenty item criterion 
test. This test could be of two types: remembering or inferring. Each 

subject received only one type of criterion test. Half of the subjects 
who received remembering questions during the listening activity received 
the remembering criterion test; the other half received an inferring cri- 
terion test. Similarly, half of the subjects who received inferring ques- 
tions during the listening activity received the inferring criterion test; 
the other half received the remembering criterion test. One of the control 
groups received the remembering criterion test; the other the inferring. 

See Table 1 (p. 22) for a pictorial display of the above description. 

Subjects 

The subjects (N = 90) were mentally retarded children who were 
students in intermediate special education classes in Indianapolis, Indiana. 
They ranged in chronological age from 114 months to 168 months (X » 141.60, 
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S.D. * 10.90) with mental age ranging from 74.48 months to 132.88 months 

(X = 98.42, S.D. » 13.10). The sample included 54 males and 36 females. 

The schools from which these children were selected were chosen in such a 

way as to give as representative a sample of EMR children as the urban 

population from which they were drawn would allow. As such, schools 

were selected in order to give as complete a balance between race (i.e., 

1 

black and white) and socio-economic status (i.e., lower class and middle 
class) as possible. A school was determined to be predominately black or 
white depending upon which race made up more than 50% of the population 
of that school and the socio-economic status of the school was determined 
by the Director of Research for the Indianapolis Public Schools. The 
children were sampled from a total of 11 different schools. 

Subjects were randomly selected from special education classes 
within each of the 11 schools. A child was included in the study if he 
or she was one of those randomly selected and was not eliminated by that 
child's teacher because of either hearing problems, speech problems or 
severe behavior problems. Children were then randomly assigned to treat- 
ment conditions. A subsequent t-test of means confirmed that mental and 
chronological agq were randomly distributed among treatment and control 
groups . \ 

Materials 

The Story 

Listening to a short mystery story was selected as the learning 
task in this study. This was done for two reasons. First, was the import- 
ance given to learning about the effects of questions on the acquisition 
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of verbal information and the particularly great importance of learning 
• through listening in real school situations. The second reason was that a 
short story would be more likely to keep the child on task while at the 
same time it could be broken down into 10 separate sections for the purposes 
of experimentation and analysis, with each section being treated as a 
separate unit. 

The particular story the children were asked to listen to was a 
high interest- low vocabulary (vocabulary fourth grade) short mystery story 
»* •• of approximately 2400 words. It was broken down into ten distinct sections 

of approximately equal length. The story is about a boy named Leroy who 
finds a map which he believes marks the spot where bank robbers buried 
some stolen money. The spot, he fiyids out later, is in his backyard, but 
does not contain stolen money. Rather, it is the spot where some men did 
some digging in order to find out if there was oil under the ground there. 
The complete story may be found in Appendix A. 

* 

Each of the ten sections was re-written in such a way as to yield 
a total of four questions, two of which were remembering and two inferring. 
The operational definitions of these forms are derived from the work of 
Lynch and Ames (1971b). The most important distinguishing characteristic 
of remembering questions is that they call upon the child to recall some- 
thing. lie may be asked for. a straightforward summary of something or a 
piece of information, it does not call upon the child to interpret or 
draw conclusions from data. Inferring questions are distinguished by the 
fact that they call upon the child to arrive at his own interpretation, 
deduction, or conclusion from available information. A task in this cate- 
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gory asks the child to "go beyond thej data" and arrive at some sort of 
conclusion. The answer to one of the remembering questions and one of the 
inferring questions could be found in the first half of each section. 

The answer to the remaining remembering and inferring questions could be 
found in the second half of each section. 

After the story was re-written", it was recorded on audio tape. 
Several professional-story-tellers made recordings and the one which was 
considered to be the best with respect to 'voice' modulation, enthusiasm, 
word pronounciation, and general technical quality of the recording was 
finally selected. The story was put on tape and ultimately communicated 
to the experimental subjects through that medium in an attempt to reduce 
the variance which may have been accountable by the experimenters' reading 
speed, pronourciation, enthusiasm, etc. 

, X 

Rest Activity (Cuisenaire Rods) f 

f 

After listening to the story there was a rest period of fifteen 
minutes followed by the criterion test. During this fifteen minute period 
the experimenter engaged the child in an activity of playing with Cuisen- 
aire Rods . 1 

Though the original intent of the developers of the Cuisenaire 
Rods was to help in the teaching of mathematics, the varying lengths and 

I 

bright colors of the rods made them appropriate for making stick figures, 
pictures and three-dimensional objects. The intent was to use these mate- 



^Cuisenaire Company of America, Inc. 
9 Elm Avenue 

Mt . Vernon, New York 10550 
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rials in a carefree, non-threatening and relaxing manner. They were not 
used instructional ly in this study but only to provide a vehicle for fil- 
ling the fifteen minute period of time with an enjoyable, but non-tiring 
activity. 

Criterion Test 

The criterion test was administered fifteen minutes after the 
listening activity. There were two types of criterion test: remembering 

and inferring. Each test was composed of twfenty free recall type items 
in total, but each subject received only one type of test (i.e., either 
remembering or inferring but in no case both). 

Each criterion test consisted of two subtests: a relevant subtest 

and an incidental subtest. The relevant questions were those which had 
been asked, earlier during the listening task and as such had been heard by 
the subject once before. The incidental subtest contained items based upon 
the same story which the subject had listened to earlier, but were questions 
he had never heard before. All items were classified by type by two people 
who had demonstrated consistent reliability with the Lynch-Amcs (1971) 
category system. Copies of the criterion tests may be found in Appendix B. 

Experimenters ' 

Six people served as experimenters in this study. Five were fe- 
male; one was male. All were employed by the Center for Innovation in 
Teaching the Handicapped, Indiana University, Bloomington, Indiana, as 
either research assistants or associates and represented a yariety of 
backgrounds. During a training session preceding the collection of the 
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data, the experimental procedures were rigidly standardized and practiced. 
Each experimenter received a copy of the experimental procedures. A copy 

of the instructions to oxperimenters may be found in Appendix D. 

\ 

As a check, the author, listened to the tapes of the experimental 
proceedings for all of the experimenters after the first day of data col- 
lection. Though few variations from the standard procedure were found, 
several minor infractions were noted and the appropriate experimenters 
were informed. The tapes were checked again after the second day of data 
collection and all errors in procedure were found to have been eliminated. 

Procedures 

Subjects were randomly selected from cooperating classrooms and 
taken one at a timo to the experimental room by the experimenter. After 
establishing rapport with the subject, the experimenter indicated that 
they would be listening to a short mystory story which was played on a 
tape recorder. The experimenter introduced the story with the general 
statement "that the story is about a young boy who lives in a city and 
the problems he lias whon he tries to catch some bank robbers." Uc (she) 
was also told to expoct a surprise ending. The experimenter stressed that 
it was important to listen very carefully as ho was interested in how 
much they learned from the story and that they would be asked questions 
after the story was over. 

During the listening activity the experimenter systematically 
introduced tlte questions. The questions were typed on a sheet of paper 
which the experimenter held before him. The experimenters had been in- 



33 



structed to read the questions as they were printed. These questions 
were of two types (remembering or inferring) and in one of two positions 
(pro or post). In the pre- remembering questions condition the experi- 
menter asked a remembering question before the subject listened to each 
section. As in all other conditions, the subjects were instructed to 
respond to those questions orally after listening to each of the para- 
graphs. No feedback was given . The experimenter could only say Thank 
you or O.K., but could not indicate that the answer was right or wrong. 

The pre-inferring question group was asked one inferring questions before 
each paragraph was heard. In the post-question conditions the questions 
were asked one at a time but after each section had been heard. There 
were a total 6f ten paragraphs and thus, ten questions. The control groups 
recoived no questions but did listen to the same short story. Instead, 
they- had a short twenty second break between each paragraph during which 
they just sat quietly. All of the questions were of the free recall var- 
iety and the experimenter wrote down all of the subjects' responses ver- 
batim. 

After the story was over, there was a fifteen minute rest period 
during which the subject and experimenter played with the Cuisenaire Rods. 
Each subject was informed before the 15 minute break that after they played 
for awhile, they would have a short test. 

After the rest period the exporimenter administered the twenty- 
item criterion test. Each of the questions was typed on a sheet of paper 
which the experimenter had before him. The subjects' responses were again 
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written down exactly as spoken. Each item could be read no more than / 

J' 

two times in total. Feedback to these questions could be accepting (e.g, 
O.K., Thank you, Uh huh) or positive (e.g., good, fine, etc.). The ob- 
jective here was to keep the subject on task, relaxed and responding. 

All questions and answeriwere tape recorded during the experimental 

activity. v \^ 

\ 

After the test was over, the >ubjoct was thanked very much for 
his (her) cooperation and escorted back to the classroom. The subject 
was requested not to tell anyone about the story and that it would be a 
secret between them until the next day. Then he could talk about it if 
he wished. 

Scoring of Tests 

A random sample of answers which the experimenters wrote down 
were checked by two people independently, against the corresponding audio 
tape for accuracy of the written response. In as much as this showed the 
written response to be accurate more than 99% of the time, the scoring 
of the answers was done on the basis of the responses which were written 
down by the experimenters as opposed to the taped version of the subjects' 
responses . 

The answers to each question were judged in turn; i.e., all of 
the responses to a given question were judged before the answers to another 
question. Each response was compared to that on the answer key. The 
answer key was made up in advance of any judging and was based upon the 
information contained in the story. Each response could receive one of 
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three possible scores. An answer which was completely correct earned 
2 points; 1 point was given for half credit and 0 points for an incorrect 
response. All of tho answerswere judged in such a way that the judge had 
no knowledge of the condition in which the responder to the question was. 
After all answers were judged, an estimate of reliability was obtained 
by re-scoring a random sample of answers. This produced 94.5% agreement 
between the first and second scoring. The results of an item analysis 
for each criterion test may be found in Appendix E . 

Analysis of Data 

\ 

The data were analyzed in the following manner in order to test 
the stated hypotheses : 

(1) an item analysis was done on each of the two criterion tests 

(2) all independent and dependent variable were intercorrel&ted 
(computer program BMD02D) 

(3) tho responses to the ten training questions used during the 
listening task were analyzed by a serial analysis of variance 
on trials data (4 x 10 repeated measures on trials; computer 



program ANOVAR by D. J. Veldman, 1967). j 

(4) analysis of the total between trials score was accomplished 

\ 

with a 2x2x2 ANOVA to^th repeated measures on the last factor 
(position of information in section of the story) . The first 
two factors were question type and position (BMD08V) 

(5) a 2x2x2x6 ANOVA was used to analyze the criterion data aftei 
the fifteen minute delay.. -This, was done on the incidental an< 
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total score dependent variables without the control group 
(computer program RCC5V) 

(6) a 2x2x6 ANOVA was used to analyze the relevant delayed cri- 
terion data. The criterion type was dropped from this analysis 
as type of question is accounted for in factor ^nc^ndjiecause^ 
these are relevant scores^ the questions^must'^lways be of the 
same type (computer progranrRCC5V) 

(7) a 2x5jiN0VA'~was used to analyze the relevant and incidental 
scores with the control groxip. The first factor was criterion 
type and the second factor was groups (computer program RCCSV) 

(8) the relevant criterion measures without the control were com- 



pared in a 2x2x6x2 ANOVA (computer program RCCSV). 

(9) the relevant criterion measures with the control group added 
were compared in a 2x5 ANOVA (computer program RCCSV) . 

(10) the relevant and incident criterion measures were compared 
in a 2x5x2 ANOVA (computer program BMD08V) . 

(11) a Stepwise Multiple Regression Analysis (Nie et al., 1970) 
was done on oach dependent variable. Each analysis yielded an 
equation in the following form: 

Xq « bjXj ♦ ^ 2 X 2 ■*■...♦ b n X n ♦ R 



whero 



Xo = dependent variable score 
b^ » unnormal i zed coefficient 
Xi * independent variable 
R « residual 



*The RCCSV computer program adjusts for unequal cell sizes. 



37 



f 

f 

I 

I 



Stepwise regression is a powerful variation of multiple regression 
which provides a moans of choosing independent variables which will pro- 
vide the best possible prediction of the dependent variable with the fewest 
independent variables. The method recursively constructs a prediction 
equation for one independent variable at a time. The first step is to 
choose the single variable which is the best predictor. The second inde- 
pendent variable to be added to the regression equation is that which 
provides the best prediction in conjunction with the first variable. This 
procedure continues by adding variables step-by-step until the desired 
number of independent variables have been added or until no other variables 
will make a significant contribution to the prediction equation. At each 
step the optimum variable is selected from those variables which remain. 

In order for an independent variable to be added to the regression 
equation its regression coefficient (b, unnormalized) must be significant 
as measured by the F statistic and its tolerance level must be high enough 
to demonstrate that a new dimension is being added to the prediction equa- 
tion. Consequently, stepwise regression never brings a variable into the 
equation if the tolerance. is below a specified minimum. 

Planned comparisons were done on all statistically significant 
interaction results which were predicted by the hypotheses. Post hoc 
analyses were done on all significant results which were obtained but not 
hypothesized. The Neuman-Kuels test was used for the comparison of means 
involved in these interactions. 
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CHAPTER III 

RESULTS AND INTERPRETATION 

The results are presented here under threo main headings: results 

from between trials scores, results based upon the delayed criterion tests 
and results based upon data concerned with the establishing of mathcmagenic 
behaviors. A brief interpretation of the results follow the presentation 
of each set with a more general discussion of the relationships between 

i 

the three sets of results being reserved for Chapter IV. A- correlation 

matrix of all independent and dependent variables may be found in Appendix F 

| 

Between ^ Trials Results 

The mean score and standard deviation for each of the four experi- 
mental groups is presented in Table 2. The control group received no 
between trials questions and thus has no between trials score. Each mean 
is based upon 10 trials for 18 subjects. Inasmuch as a correct response 
on any trial would earn 2 points, the highest possible score for any sub- 
ject was 20 points with the lowest being 0 points. 

An analysis of variance was carried out with question type (re- 
membering and inferring), question position (pre and post), location of 
information in paragraph (either first half or second half of the paragraph) 
and experimenter as the main factors. This was a 2x2x2x6 ropeated meas- 
ure. The results are summarized in Table 3. 

In this analysis the main effect of question type was significant 
(F « 11.01, df * 1/48, p <.005). Those subjects who received remembering 
questions did significantly better than those receiving inferring questions. 
Though this result was not surprising, it was of relatively little impor- 
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Table 2 



Means and Standard Deviations of Between Trials Total Score 



Group 


Mean 


SD 


Remembering Pre-Question 


10.17 


S . 37 


Remembering Post-Question 


11. S6 


4.87 


Inferring Pro-Question 


4.78 


4.17 


Inferring Post-Question 


9.78 


4.40 



% 
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Table 3 

Analysis of Variance of Between Trials Total Score 



Source 


1 

df 


MS 


F 


P 


Question Type (T) 


1 


112.01 


11.01 


<.005 


Question Position (P) 


1 


88.67 


8.72 


<.005 


Experimenter (E) 


5 


5.52 


<1 




Location of Information 




• 






in Section (I) 


1 


1.56 


1 




T x P 


1 


27.56 


2.71 


n.s . 


T x E 


5 


4.29 


<1 




P x E 


5 


2.76 


<1 




T x I 


1 


.01 


<1 




P x I • 


1 


39.06 


14.92 


<.001 


E x I 


5 


2.11 


<1 




T x P x E 


5 


42.75 


4.20 


< .005 


T x P x I 


1 


2.51 


1.16 


n.s. 


T x E x I 


5 


2.59 


, <1 




P x E x I 


5 


4.25 


1.62 


n.s. 


S(T x P x E) 


48 


10.17 






T x P x E x I 


5 


2.19 


<1 




SI(T x P x E) 


48 


2.62 
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tanco here as the effect of question type was only of interest in this 
study as it interacted with criterion type. The result does suggest, 
however, that these inferring questions as a group, wore probably more 
difficult than the set of remembering questions. 

The analysis also showed the main effect of question position to 
bo significant (F = 8.72, df * 1/48, p {.005) with those subjects receiving 
post -quest ions getting significantly higher scores (X « 10.67) than those 
subjects who received pre-questions (X =• 7.47). This finding was in the 
opposite direction of what was predicted. 

The hypothesis that children would do better on those items where 
the information necessary to answer the question was close, as opposed to 
distant from the question, was not supported as there was no significant 
main effect due to the location of information in the paragraph (F ■ <1, 
df » 1/48). The interaction between question position and the location 
of answer in the section, however, was highly significant (F = 14.92, df ■ 
1/48, p < .001; see Figure 2 for the graph of this interaction). Results 
of the planned comparison test showed that subjects did better on items 
where they received a post-question and the information needed to answer 
the question was in the last half of the section than on items where they 
received a pre-question and the information was in the first half of the 
section (p { .005). On items where the information needed to answer the 
question was presented in the first half of the section, subjects who re- 
ceived post-questions performed better than those who received pre-questions 
(p ( .05). On items where the information needed to answer the question 
was presented in the last half of the section, subjects who received a 




a 




Close Distant 

Relativo position of information 
in section of story 
to training position 



Figure 1 Graph of means of the interaction between location 
of information in sections of the story and 
position of question. 
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post-question out performed those subjects who had received pre-questions 
(p C .05) . 

Results of the analysis of variance also showed the interaction 
of question type, question position and experimenter to be significant 
(F = 4.20, df =* 5/48, p C .005). See Figure 3 for the graph of this in- 
teraction. The Ncuman-Keuls test was used in order to establish which 

\ 

means were significantly different from each other. Results of this com- 
parison showed experimenter 5 (E^), when asking post -remembering questions, 
to he able to get higher achievement from tho children than experimenters 

1, 2, and 6 when they asked pre-inferring questions (p < .01). The sub- 

\ ^ 

jects of Eg who received post- remembering questions also did better than 

vT 

the subjects of Eg, E 4 , Eg who received pre-inferring questions (p < .05), 
better than the subjects of E 5 and E^ when they received post-inferring 
questions (p < .05) and better than Eg and Eg when their subjects received 
pre-remembering questions (p < .05). Children who received post-inferring 
questions from E 2 did better than children wlio received pre-inferring 

X 

questions from E|, E 2 , or Eg (p < .05). The children who receiy.jpd post- 
remembering questions from did better than the children of E 2 who re- 
ceived pre-inferring questions (p < .05) and subjects who received post- 
inferring questions from Ej did better than those children who received 
pre-inferring questions from E 2 (p <. .05). ' \ 

These differences between experimenters may be accounted for in 

V 

several ways. A first possibility is that chance alone may have accounted 

; j 

'lr 
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Figure 2. Graph of the raean scores involved in the interaction 




'-1 



\ ’ 

for the observed differences. A second is that although a t-test of means 
showed the experimental groups not to be significantly different from each 
other with regard to mental age, it is possible (though unlikely) that 
they were not randomly assigned to treatment groups and, in fact, differed 
on some variable which was not measured. The third explanation which 
could be offered is that there were in fact real differences in the be- 
havior of experimenters which accounted for the significance of the 3-way 
interaction. While this might be true, of the IS means which were found 

t 

to be significantly different, differences among 9 of these were likely 

' X 

/ 

due to the factor of criterion- type. The remaining 6 may indeed be due 
to experimenter effects, but it would be risky at best to attempt to hy- 
pothesize the exact nature of these experimenter behaviors. The lack of 
any other significant interactions involving experimenters, the lack of 
a main effect involving experimenters, and the lack of any clear trend or 
influence of experimenters in the 3-way interaction would make this sheer 
speculation. 

A stepwise multiple regression analysis was dono on each of three 

dependent variables: (1) subjects' scores on items where the question 

/ 

and information needed to answer the questions were in close proximity 

\ 

to each other, (2) subjects' scores on tiems where the question and in- 
formation necessary to answer the question were distant from each other, 

.. and (3) the sum of the two preceding scores or the total betwecn-trials 
score. 

The following independent variables could be included in the re- 
gression equation depending upon the significance of the variable (i : sta- 
tistic) and its tolerance level: question type, question position, mental 
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age, chronological age, sex, race of child, race of school and socio-economic 
status of the school which the child was attending. The results of these 
regression analyses are/ summarized on Tables 4, 5, and 6. 

In looking at Table 4 (prediction of close proximity scores) it 
can be seen that question position, mental age and question type account 
for over 41% of the variance with question position and question type to- 
gether accounting for 3 times as much variance as mental age. Though both 
question type and question position showed significant main effects in the 
analysis of variance, it seems that question position is the more powerful 
variable of the two, at least in its relationship to scores on close prox- 
imity items. Sex, race of the child and socio-economic status reached sta- 
. ( 

tistical significance but do not appear to add appreciably to the precision 
of the regression equation. Predominant race of the school attended by 
the child and chronological age were not included in the equation. 

In predicting scores on distant" proximity items (Table 5) mental 
age and question type continue to account for approximately 10% of the 
variance each, but question position contributes only slightly more than 
1.5V of the total variance. This is a drop from almost 21% of the variance 
in predicting close proximity scores. A total of 31% of the variance was 
accounted for in predicting scores on those items where the question and 
information needed to answer tho question were separated by at least half 
of the section. 

In predicting the total between- trials score (Table 6), 39.39% 
of the variance was accounted for with all variables entered being, included 
in the regression equation. Question type, mental age and question position 
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Table 4 

Stepwise Multiple Regression Analysis on 
Close Proximity Scores 



Independent 

Variable 


R Square 


B 


Standard 
Error B 


F 

(at- last step) 


Question Position 


.2074 


2.8402 


.5468 




Mental Age 


.3069 


0.0725 


.0206 


- 


Question Type 


.4188 


-1.9555 


.5570 




Sex 


.4206 


0.3103 


.5768 




Race of Child 


.4229 


0.2788 


.5740 




SES of School 


.4232 


-0.0953 


.5590 


7.95* 


(Constant) 




-4.7457 


• 





*df ■ 6/65, p < .001 



Variables not included : 
Race of School 
Chronological Age 
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Table 5 

Stepwise Multiple Regression Analysis on 
Distant Proximity Scores 



\ 



Independent 

Variable 


R Square 


D 


Standard 
Error B 


F 

(at last step) 


Mental Age ' 


.1113 


.0839 


.0273 




Question Type 


.2156 


-2.0766 


76384 




SES of School 


.2556 


-1.2735 


.6748 




Race of School 


’ .2805 


1.2629 


.8824 




Question Position 


.2983 


.7198 


.6193 




Chronological Age 


.3081 


.0314 


.0339 




Race of Child 


.3115 


.4816 


.8578 


4.13* 


(Constant) 




3.6627 







*df ■ 7/64, p < .001 



Variables not included: 



Sex of Child 
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Table 6 

Stepwise Multiple Regression Analysis on 
Total Between Trials Score 



Independent 

Variable 


R Square 


B 


Standard 
Error B 


F 

(at last step) 


Question Type 


.1161 


-4.0402 


1.0804 




Mental Age 


.2479 


.1634 


.0461 




Question Position 


.3640 


3.5872 


1.0516 




SES of School 


.3789 4 


-1.2385 


1.1411 




Race of School 


.3876 


1.1340 


1.4932 




Chronological Age 


.3930 


-.0439 


.0575 




Sex of Child 


.3938 


.3033 


1.1081 




' Race of Child 


.3939 


-.1696 


1.4602 


5.12* 


(Constant) 




-.1117 







/ 

*df ■ 8/63, p < .001 




i 

i 

/ 



\ 




59 



50 



continued to account for the major share of the accountable variance 
(36.40%) with each variable now contributing approximately equal amounts. 

These data on short term retention and achievement suggest question 
position to bo an important variable in influencing the learning of edu- 
cable mentally retarded children. As a main effect subjects who received 
post-questions did consistently better than those who received pre-questions. 
This of feet, though significant, was in a direction opposite to that of 
the prediction. It was hypothesized that the pre-question condition con- 
dition would cue the subject to listen for the appropriate information 
(i.e., the answer) and thus enhance his retention of the critical informa- 
tion. This effect, it was thought, would be superior to the effect of 
tho expectation held by others who wore receiving post-questions, that 
they wou]d be askod a question about the section of the story they had 
just heard. 

The significant interaction between question position and location 
of the critical information in the section of the story seoms to offer 
data in support of an explanation for this finding. Soe Figure 2 (p.44) for tnc 
graph of this interaction. A comparison of the scores on tho post-question 
close-proximity items with the pre-question close-proximity items provides 
the clearest example of the interaction. Subjects seemed to do better 
when both the quostion and information were in close temporal proximity 
to the time when the response was called for. The group which had both 
the question and critical information furthest removed from the call for 
the responso did tho worst. A comparison of the moans involved in this 
interaction shows that in no caso did a pre-question group perform better 



than a post -question distant group and the pre-question close group offers 
further support for the importance of the question position variable. 

Though the appropriate interaction of question position and location of 
information provides the most power in influencing retention, location of 
critical information relative to ithe request for the response to the ques- / 
tion is not a powerful variable when taken alone. This is further supported 
by the lack of a significant main effect of location of information in the 
analysis of variance (Table 3). 

A further comparison of the interaction means suggests that the 
information in the story may be more easily remembered than the questions 
which preceded the individual sections. The significant difference between 
the post-question distant items and the pre-question close proximity items 
in favor of the post-question distant items suggests that even when the 
information necessary to answer the question is furthest removed from the 
request for the response, subjects do better than when the question pre- 
cedes the section and is quickly followed by the response. 

The pre-question, close proximity situation seems to offer the 
greatest opportunity for the interference of new information to influence 
forgetting. Thus the information which follows the question, but before 
the answer serves to interfere with the retention of the question. And 
then the information which follows the critical information (i.e., the 
answer) again serves to interfere with the retention of this critical in- 
formation. In the post-question conditions this interference between in- 
formation and question presentation does not have a chance to operate. 

The explanation of the findings offered by this interference hypothesis 
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is further supported by the difference (though not statistically signif- 
icant) in favor of the post-question close-proximity items over the post- 
question distant proximity items. 

The application of the short-term memory deficit (Kills , 1963, 

1970) hypothesis may also be used to explain those results as subjects 
(1) did consistently hotter in the post-question condition, (2) showed 
better retention where the question was a post-question and the informa- 
tion necessary to answer the question was in the second half of the section 
as opposed to the pre-question group with the information in the first half 
of tho section. In this last group the greatest amount of time exists 
between the time of the presentation of the information and the request 
for the response; thus, creating the greatest opportunity for forgetting. 

The former group of course, creates the least opportunity for forgetting. 

In order to test which of these hypotheses offors tho best explan- 
ation for the observed phenomenon, further research would need to be done. 

By controlling the nature of the activity between the presentation of the 
critical information and the call for the response (i.e., by continuing 
to present information or just allowing time to lapse) one would be able to 
identify tho relative effects of forgetting through interference, and for- 
getting through the passage of time alone. 

In summary, it seems clear that question position is an important 
variable in influencing the shortterm retention of educable mentally re- 
tarded children. And, moreover, that the best results on an immediate 
retention test may be obtained under conditions where there is close temporal 
contiguity between the question and the presentation of the information 
required to provide the correct answer. 
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Delayed Critertion Test Results and Interpretation 

Results to be reported in this section are concerned with subjects' 
performance on the delayed (after 15-minute rest period) criterion test. 

There were 20 items on this criterion test. Ten of these items were "rele- 
vant" items -- the questions which were asked by the experimenter during 
the listening activity, and 10 were "incidental" items -- questions relating 
to the material covered in the story but not asked during the listening 
activity. Thus the criterion test was composed of 2 subtests. 

Though the control groups received all 20 items as the criterion 
test, it must be pointed out that in fact, they have no "relevant" items. 
This is a result of their having received no training questions during the 
listening activity. The same situation exists for those groups who received 
one type of training question and another type of question Oii the criterion 
test. Because the questions they received on the criterion test were of 
a different type than those ‘they heard during the listening activity, they 
really have no "relevant" criterion test questions either. These clari- 
fications in mind, it must be remembered that the term relevant is used to 
describe a certain subsot of 10 items on the criterion test and not any 
treatment conditions. The same applies for the term incidental which refers 
to the remaining set of 10 items. These 10 items were never heard by any 
of the subjects but nevertheless pertain to information presented in the 
10 sections of the story. 

A 2x2xSx2 analysis of variance was used to compare the relevant 
delayed criterion test scores without the control group. The main factors 
involved in this analysis wtere question type (remembering and inferring) , 
question position (pre and post), experimenter (6 levels) and criterion 

/ 
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type (remembering and inferring). The means and standard deviations for 
each of the ten experimental groups may be found in Table 7. Each mean 
was derived from 18 subjects and based upon 10 questions. 

The results of this analysis, summarized in Table 8, showed a signifi- 
cant main effect of criterion type (F « 10.71, df - 1/24, p < .005). Closer 
examination of the means indicated that subjects did significantly better 
on the remembering criterion test than on the inferring criterion test. 

This finding was consistent with results of the between trials analysis 
where a significant difference was found with respect to question type. 

The results of the analysis also showed a significant interaction 
between question type and criterion type (F « 9.70, df * 1/24, p < .005). 

See Figure 3 for a graph of this interaction. Results of a planned compar- 
ison test of means showed that subjects who received remembering training 
questions and a remembering criterion test performed better than those sub- 
jects who received inferring training questions and the remembering criterion 
test (p < .005); and that subjects who received inferring training questions 
and an inferring criterion tost did better than those subjects who re- 
ceived remembering training questions and the inferring criterion test 
(p < .05). These findings were in the predicted direction and confirm 
hypotheses 10 and 11. Other means were also found to he significantly dif- 
ferent f-om each other: subjects who rocoived remembering training questions 

and a remembering criterion test did bettor than the group which received 
remembering training questions and inferring criterion tost (p < .005) 
and the group which received inferring training questions and inferring 
criterion test (p (, .01), and subjects who received inferring training ques- 
tions and a remembering criterion test did better than those subjects who 
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Table 7 

Means and Standard Deviations of Relevant 
Delayed Criterion Test Scores 
(Based on a possible total of 20 points) 



Group 


Mean 


SD 


Criterion Test 1 (Remembering) 






Remembering Pre-Questions 


13.55 


3.71 


Remembering Post-Questions 


11.67 


5.12 


Inferring Pre-Questions 


, 7.88 

\ 


4.40 


Inferring Post-Questions 


9.22 


3.23 


Control (No Questions) 


10.22 


4.42 


Criterion Test 2 (Inferring) 






Remembering Pre-Questions 


5.44 


2.92 


Remembering Post-Questions 


6.11 


1.83 


Inferring Pre-Questions 


3.78 


3.53 


Inferring Post-Questions 


8.66 


4.69 


Control (No Questions) 


8.11 


5.46 
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Table 8 

Analysis of Variance, of Relevant Delayed Criterion 
Test Scores without Control Group * 



Source 


df 


MS 


F 


P 


Question Type (T) 


1 


7.56 


< 1 




Question Position (P) 


1 


1.17 


< 1 




Experimenter (E) 


5 


19.88 


1.13 


n.s . 


Criterion Type (C) 


1 


189.06 


10.71 


<.005 


T x P 


1 


8.51 


< 1 




T x E 


5 


19.41 


1.10 


n.s . 


TxC 


1 


171.17 


9.70 


<.005 


P x E 


5 


4.02 


<1 




P x C 


1<> 


.06 


<1 




E x C 


•' • 5 


7.75 


<1 




T x P x E 1 


5 


11.26 


<1 




T x P x C 


1 


27.56 


1.56 


n.s . 


T x E x C 


5 


8.62 


<1 




P x E x C 


5 


18.45 


1.05 


rt.s . 


T x P x E x C 


5 


16.25 


<1 




Error 


24 


17.65 






TOTAL 


71 









These data were analyzed in a 2x2x2 ANOVA as well, eliminating 
the experimonter factor. This was done in order to allow the RCC5V pro- 
gram to analyze the same data using equal n's. The results of this an- 
alysis showed the criterion type main effect to be significant at tho 
p < .005 level as woll as the TxC interaction (p < .005). All other ef- 
fects were nonsignificant. 
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received remembering training questions and an inferring criterion test 
(p <. .05). These last three comparisons, however, though significant, may 
be accounted for by differences in apparent difficulty' in the criterion 

V 

test and thus are not of particular interest here. 

hypothesis 6 predicted that children who received pre-questions 
would do better on the relevant criterion test than children who received 
post-questions. The lack of a significant main effect of question position 
requires the rejection of this hypothesis. 

A 6x6 analysis of variance was carried out in order to compare /the 

/ 

relevant criterion test results with the control groups included. The main 
factors were 6 levels of experimenter and 6 levels of group. With the ex- 
ception of the control groups, who of course received no training questions, 
only those subjects who received the same type of criterion question as 
training question were included in this analysis of 36 experimental subjects 
and all 18 of the control subjects. 

The results of this analysis (as summarized in Table 9) indicated 
that there were no significant differences between groups, experimenters 
o^ in the interaction of these two variables. Thus Hypothesis 4, which 
predicted that children who received questions would do better on the 
relevant criterion test which asked the- same questions as were in the train- 
ing, than children who were in the control group and did not receive train- 
ing questions, was not supported by the data. Yet previously presented 
data suggest that there is indeed an interaction between question typo and 
criterion type. A careful inspection of the means (Table 7) offers a 
possible explanation of these two findings. 
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fable 9 

Analysis of Variance of Relevant Delayed Criterion 
Test Scores with Control Group 



Source 


df 


MS 


F 


P 


Groups (G) 


5 


42.31 


1.73 


n.s. 


Experimenter (0) 


5 


29.53 


1.21 


n.s. 


G x E 


25 


15.63 


<1 


_ — — 


Error 


18 


24.44 






TOTAL 


53 
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While the experimental groups which received the same type training 
question as criterion question did better than the control subjects, those 
subjects who recoivcd training questions and criterion questions which were 
not of the same type seem to have done worse than the respective control 
groups. Thus, while it can not be said that questioning in general produces 
better relevant retention after the 15 minute delay, it does seem clear that 
the best retention is obtained under conditions where the questions which 
are asked are of the same type as the criterion test questions. Thus, if 
questions arc to be asked during instruction, it seems clear that attempts 
must be made to maintain consistency between the level of questions asked 
during the training period and those to be included on a criterion test. 

Table 10 contains the means and standard deviations of all incidental 
delayed criterion test scores. Each of these scores is derived from 9 
subjects and based upon 10 items. 

A 2x5x2 analysis of variance was carried out on the relevant and 
incidental delayed criterion subtest scores with the control groups included. 
The main factors were criterion type (remembering and inferring), groups 
(S levels) and subtests (relevant and incidental). A summary of these re- 
sults may be found in Table 11. As before, the criterion type main effect 
was found- to be significant (F = 7.50, df » 1/16, p < .OS) with children 
who received the remembering criterion test doing better than those who re- 
ceived the inferring criterion vest. Significant interaction.? were also 
found between criterion type and groups (F ■ 4.80, df » 4/64, p < .01) end 
groups and the subtests ( F » 3.42, df » 4/64, p < .05). 

With respect to tho interaction of criterion type and groups, a test 
of planned comparisons showed the following moans to bo significantly dif- 
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Table 10 

Means and Standard Deviations of Incidental 
Delayed Criterion Test Scores 
(Based on possible total of 20 points) 



Group 


Mean 


SD 


Criterion Test 1 (Remembering) 






Remembering Pre-Questions 


6.44 


2.88 


Remembering Pos ^Questions 


6.33 


3.46 


Inferring Pre-Questions 


6.33 


2.69 


Inferring Post-Questions 


7.00 


4.56 


Control (No Questions) 


6.11 


3.41 


Criterion Test 2 (Inferring) 






Remembering Pre-Questions 


3.56 


3.28 


Remembering Post-Questions 


6.11 


5.37 


Inferring Pre-Questions 


7.67 


3.77 


Inferring Post-Queations 


6.89 


4.96 


Control (No Questions) 


4.67 


2.40 
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✓ Table 11 

Analysis of Variance of Relevant and Incidental 
Delayed Criterion Subtest Scores with Control Groups 



Source 


df 


MS 


F 


P 


Criterion Type (C) 


1 


133.47 


7.50 


<•05 


Groups (G) 


4 


5.91 


<1 




Subtests (RI) 


1 


29.61 


<1 




Error S (RI) 


16 


50.19 






C x G 


4 


58.75 


4.80 


<•01 


C x RI 


1 


2.45 


<1 




G x RI 


4 


45.13 


3.42 


<•05 


Error CS(RI) 


16 


17.78 






Error GS(RI) 


64 


13.21 






C x G x RI 


4 


13.89 


1.14 


n.s. 


Error CGS(RI) 


64 


12.24 
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fcrent from each other: on the remembering criterion test children who 

received remembering pre-questions did better than those children who re- 
ceived inferring pre-questions (p < .05); on the inferring criterion test, 
the children who received inferring pre-questions did better than those 
children who received remembering pre-questions (p < .05) and children who 
received inferring post-questions did significantly better than those chil- 
dren who received remembering pre-questions (p < .05). See Figure 4 for a 
graph of this interaction. Significant differences were also found between 
criterion types in the pre-question condition, with those receiving the 
remembering criterion test doing better than those who took the inferring 
criterion test (p < .005). And again, the pre-question group which received 
the remembering criterion test did better than the remembering post-question 
group that received the inferring criterion test (p 4 .05). These last two 
interactionswere thought to be significant as a result of the unequal diffi- 
culty of criterion test items, and thus, not of great interest here. No 
significant differences were found in comparing treatment groups with the 
control group on the same criterion test type for either the remembering or 
inferring criterion tests, or between control groups. 

In looking at the interaction of group by subtest (relevant and in- 
cidental), the planned comparision test of means showed several differences 
to be significant (See Figure 5 for graph of this interaction). Those sub- 
jocts who received inferring post-questions did significantly better on the 
incidental criterion test than did those who received the same type of train- 
ing question on the relevant criterion test (p < .005) and those who received 
inferring pre-questions on the relevant test (p < .05). In comparing control 
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Figure 4. Graph of the interaction between criterion type and groups. 
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groups on the relevant and incidental criterion tests, no significant dif- 
ferences were found. This finding suggests the relevant and incidental sub- 
tests to be of equal difficulty. The tost of means also showed the post- 
inferring group to do significantly better on the incidental subtest than 
either the post -remembering group (p < .005), the incidental control group 
Cp <. .005) or the p re -remembering group (p < .01) did on the incidental cri- 
terion subtest. In comparing relevant subtest scores, the post- remembering 
group was found to have done significantly better than the post-inferring 
group (p < .05). 

A close look at the graph of this interaction (Figure 5) shows prac- 
tically a mirror image difference between the relevant and incidental scores. 
Kith the exception of the pre-question remembering group and the control 
group, the scores are just the opposite of each other with respect to treat- 
ment groups. The most interesting results arc the differences between the 
post-inferring groups on the incidental subtest as compared with all the 
others. The difference between this group on the incidental and relevant 
subtests, for example, suggests that by asking post-inferring questions one 
cannot hope to increase both incidental and relevant learning and that, in 
fact, in asking post-inferring questions one significantly increases the in- 
cidental learning while tending to suppress relevant learning (difference 
between this group and control on relevant learning in favor of control group 
though not statistically significant). 

The significantly better performance of the post-inferring group on 
the incidental subtest as compared with the performance of the control, pre- 
remembering and post-remembering* groups on the incidental subtests suggests 
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the asking of post-inferring questions to be an effective technique for 
increasing incidental learning. 

The asking of post-remembering questions also seems to be a better 
way of producing relevant learning than the use of post-inferring questions. 

The fact that the relevant means of these two groups differ significantly from 
each other but not from the relevant control mean suggests a relatively small 
facilitating effect for the post -remembering questions and a relatively small 
suppressing effect for the post-inferring questions on relevant learning. 

Table 12 contains the summary data from the 2x2x6x2 analysis of vari- 
ance data using total criterion test score as the dependent variable. The 
main factors in this analysis were question type (2 levels), question posi- 
tion (2 levels), experimenter (6 levels) and criterion type (2 levels). A 
significant main effect of criterion type was found (F « 4.26, df = 1/24, 
p < .05) with children continuing to do better on the remembering than on 
the inferring criterion test. A significant interaction between training 
question type and criterion type was also indicated (F ■ 4. 54, df = 1/24, 
p < .05). See Figure 6 for the graph of this interaction. The planned 
comparison test of means showed the remembering training question - remembering 
criterion type group to have done significantly better than the remembering 
training question - inferring criterion type group (p < .005). This again 
was thought to have been due primarily to differences in degree of difficulty 
between criterion types. The mean of the remembering training question - 
inferring criterion test group was also found to be significantly lower than 
the means of both the inferring training question - inferring criterion test 
/group (p < .05} and the inferring training question - remembering criterion 




77 






68 



Table 12 

Analysis of Variance of Total Delayed Criterion 
Test Scores' 'Without Control Group 



Source 


df 


MS 


F 


P 


Question Type (T) 


1 


6.67 


<• 1 




Question Position (P) . 


1 


10.56 


< 1 




Experimenter (E) 


S 


43.97 


< 1 




Criterion Type (C) 


1 


232.56 


4.26 


<.05 


T x P 


1 


2.51 


<1 




T x E 


S 


73.87 


1.35 


n.s. 


T x C 


1 


248.06 


4.54 


< .05 


P x E 


S 


19.73 


< 1 




P x C 


1 


.56 1 


< 1 




E x C 


S 


38.96 


< 1 




T x P x E 


S 


62.64 


1.15 


n.s . 


T x P x C : 


1 


98.34 


1.80 


n.s. 


T x E x C 


S 


30.43 


< 1 




P x E x C 


S 


58.63 


1.07 


n.s. 


T x P x E x C 
Error 
TOTAL 


S 

24 

71 


33.91 

54.65 


.62 
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type group (p < .05), thus offering partial confirmation of hypothesis 10 -- 
that the total score criterion test results would be highest for children 
receiving the same type of criterion test as training question. 

The results of analyses in this section necessitated the rejection 
of two hypotheses. A planned comparison test of the control group incidental 
means and training question groups' incidental means indicated no signifi- 
cant differences and thus required the rejection of hypothesis 5 -- that 
children who received questions would do better on the incidental criterion 
test than children who were in the control group and did not receive train- 
ing questions. 

The lack of a main effect difference for subtests (RI) in Tabic 11 
required the rejection of hypothesis 9 -- that children would do better on 
the relevant subtest than on the incidental subtest. 

Tables 13, 14, and 15 contain the results of the stepwise multiple 
regression analysis on the dependent variables of relevant delayed criterion 
test scores, incidental delayed criterion test scores and total delayed 

criterion test scores respectively. 

\ 

The results of the regression analysis on the relevant criterion 
test scores (Table 13) showed that a total of 41.54% of the variance was 
accounted for with all of the 9 independent variables entered. Mental age 
accounted for the greatest amount of variance for a single variable (15.19%) 
with criterion type accounting for approximately 15% of the variance. 
Question position accounted for the least amount of variance of all the 
variables entered (.06%). 

In predicting the incidental criterion test scores (Table 14) the 
multiple regression analysis accounted for a total of 32.76% of the variance. 

\ . 
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Table 13 

Stepwise Multiple Regression Analysis on Relevant 
Delayed Criterion Test Scores 



Independent 

Variable 


R Square 


B 


Standard 
Error B 


F; 

(at last step) 


Mental Age 


,1519 

j 


0. 1552 


.0381 




Criterion Type 


.2816 


-3.4550 


.8762 




Chronological Age 


.3670 


-0.1121 


.0481 




SliS of School 


.4033 


-1.6789 


.9412 




Question Type 

( 


.4088 


-0.6677 


.8918 




Sex 


.4116 


0.5913 


.9140 




Race 


.4141 


0.6974 


1.2189 




Race of School 


.4148 


-0.3406 


1.2369 




Question Position 


.4154 


0.2155 


.8676 


4.90* 


(Constant) 




16.4287 







^*df = 9/62, p < .001 



I 

* 
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Table 14 

Stepwise Multiple Regression Analysis on Incidental 
Delayed Criterion Test Scores 



Independent 

Variable 


R Square 

1 


B 


Standard 
Error B 


1 F 

(at| last step) 


Mental Age 


/ .1710 

/ 


.1414 


.0365 




Sex 


/ 

/ .2530 


-1.8994 


.8773 




Race of School j 


' .2771 


1.8737 


1.1874 


' 


Chronological Age/ 

j 


.2979 


-0.Q650 


.0462 




Question Type / 

j 


.3077 


0.8714 


.8561 




Race of Child/ 


.3156 

/-* 


-1.1581 


1.1701 


\ 

\ 


Criterion Ty£e 


.3222 


-0.6424 


.8412 


Question/Position 


.3267 


0.5531 


.8329 




SES of School 


.3276 


-0.2509 


.9035 


3.36* 


(Constant) 




2.5011 







•df • 9/62, p < .005 



Table IS 



Stepwise Multiple Regression Analysis on Total 
Delayed Criterion Test Scores 



Independent 

Variable 


R Square 


B 


Standard 
Error, B 


F 

(at last step) 

1 


Mental Age 


.2118 


.2966 


.0649 






Criterion Type 


.2687 


-4.0974 


1.4938 






Chronological Age 


.3440 


-0.1771 


.0820 






Sex 


.3550 


-1.3081 


1.5581 






SI-S of School 


.3671 


-1.9298 


1.6046 






Race of School 


.3736 


1.5331 


2.1087 






Question Position 


.3764 


.7686 


1.4791 






Race of Child 


.3768 


-0.4608 


2.0781 






Question Type 


.3770 


0.2036 


1 . S2d3 


4 


.16* 


(Constant) 




18.9299 




* 





T 



*df » 9/62, p < .001 
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Mental age continued as the single most powerful variable (17.10%) with 
sex contributing the second most (8.20%). Question typo and position; 
though having statistically significant contributions added little to the 
precision of the regression equation. The same observation holds for the 
influence of criterion type. 

The results of the regression analysis on the dependent variable 
°f total score (Table 15) shows a total of 37.70% of the variance being 
accounted for. Mental age contributes over 21% to this total with criterion 
type being the second most important variable (5.79%). Again question type 
and question position, though making statistically significant contributions 
to the total variance accounted for, contribute little in terms of in- 
creasing the precision of the equation. 

The most striking observation in comparing these regression analyses 
with those of the between trials scores is the difference in the power of 
the question position variable in the short term task relative to the de- 
layed criterion task. It appears as if the more remote the criterion task 
from the learning task, the less important the position variable in influ- 
encing learning outcomes. 

Results and Interpretation with Respect to Mathcmagcnic Behaviors 

Two different 'methods were used in an attempt to test the hypothesis 
that questions may facilitate the development of mathcmagcnic behaviors and 
result in the establishing of learning sets or, more specifically, in this 
study, listening skills. It was predicted that if questions do have this 
facilitativc effect then two things would happen. First, the subjects' 
performance on the between trials items would improve as the listening 

/ 

/ 
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task procedcd. And secondly, that the post-question conditions would have 
a more powerful influence on the incidental delayed criterion test scores 
than the pre-question conditions, when the criterion test was of the same 
type as the training questions. 

The mean incidental score and standard deviation for each of the 
8 experimental groups and 2 control groups is presented in Table 10. Each 
mean was derived from 9 subjects and based upon 10 questions. These were 
the 10 items in the 20-item criterion test yhich the subject had never 
heard before. They were not training questions though they did pertain to 
information covered in the story. Inasmuch as a correct response on any 
trial would cam 2 points, the highest possible score for any subject was 
20 points with the lowest being 0 points. 

A 2x2x6x2 analysis of varianco was used to analyze these incidental 
delayed criterion test scores. The factors were question type (remembering 
.and inferring), question position (pro and post), experimenter (6 levels), 
and criterion type (remembering and inferring). The (results of this analysis 
are summarized in Table 16. 

This analysis showed no main effects or interactions to be signif- 
icant at the p < .05- level of significance. The main effect of question 
position did not show post-questions to produce better incidental score 
\ performance than pre-questions. < 

Nor was the prediction that the interaction of question type, question 

, position, and criterion type would be significant found to be in the predicted 

< 

direction. Thus, it must be concluded from those results that the training 
questions wero not significant in influencing the development of mathemagenic 
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■. Table 16 

Analysis of Variance of Incidental Delayed Criterion 
Test Score without Control Group 



Source 


df 


MS 


F 


P* 


Question Type (T) 


1 


28.44 


1.77 


j 

n.s . 


Question Position (P) 


1 


4.69 


<1 




Experimenter (E) 


5 


.7.14 


<1 




Criterion Type (C) 


1 


2.25 


<1 




T x P 


1 


1.78 


<1 




T x E 


5 


23.06 


1.43 


n.s . 


T x C 


1 


7.11 


* <1 




P x E 


S 


19.84 


* 1.23 


n.s . 


P x C 


1 


.;:s 


<1 




E x C 


5 


13.40 


<1 




T x P x E 


5 


32.56 


2.03 


n.s . 


T x P x C 


1 


21.78 


1.35 


n.s . 


T x E x C 


5 


12.56 


<1 




P x E x C 


5 


17.37 


1.08 


n.s. 


T x P x E x C 


5 


9.06 


<1 


* 


Error 


24 


16.08 






TOTAL 


71 




\ 

\ 





# p < .05 \ 
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behaviors or listening skills. These results require the rejection of 
Hypothesis ^5. 

The results of the analysis of variance in Table 17 show the ex- 
perimental groups not to be significantly different from the control groups 
with respect to the dependent variable of incidental delayed criterion test 
score. Thus, it appears that after the 15 minute delay between listening 
to the story and the criterion test, the experimental groups which received 
training questions were at no great advantage in answering the delayed 

incidental criterion test questions than the control groups which received 
no training questions at all. 

A serial analysis of between trials scores was done in an attempt 
to identify possible learning to learn curves. It was reasoned that if 
subjects were learning how to listen throughout the listening task, that 
their performance would imporve across trials. In addition, the serial 
analysis by trials could indicate if there was any depreciation in per- 
formance -- possibly as a function of the length of the task -- if the sub- 
jects' performance was poorer at the end of the listening task. While an 
analysis of variance was done on these data, and reported in Table 18, it 
must be noted that the results of this analysis should be viewed with caution 
for it was not established that all items (trials) were of equal difficulty. 
Indeed, there is evidence to suggest that all the items were not of equal 
difficulty (e.g., main effect difference of question type in Table 3) and 
all of the effects significant in Table 18 may be accounted for by differ- 
ences in item difficulty. As a result, it appears to be of greater value 
to look at the relationships between question position given the same question 
type. Those relationships are graphed in Figures 7, 8, and 9. 
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Table 17 

Analysis of Varianco of Incidental Delayed Criterion 
Test Scores with Control Group 



Source 


df 


MS 


F 


P* 


Criterion Type (C) 


1 


10.00 


<1 




Groups (G) 


4 


14.64 


1.02 


n.s. 


C x G 


4 


11.31 


<1 




Error 


80 


14.41 




• 


TOTAL 


89 









*p < .05 
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Table 18 

Serial Analysis of Variance of 
Between Trials Scores 



Source 


df 


MS 


F 


P 


Between 


71 


2.80 






Groups (G) 


3 


t 

15.22 


6.76 


< .001 


Error (G) 


68 


2.30 






Within 


648 


OJ 

.70 






Trials (T) 


9 


2.80 


4.65 


< .001 


G x T 


27 


2.23 


3.69 


< .001 

\ 


Error (T) 


612 


.60 






TOTAL 


719 


.91 
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Figure 7. 




Graph of means of serial analysis by trials for prc- and post- 
quostion conditions. 
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Figure 8. Graph of means of serial analysis by trials for remembering 
pre- and post- question conditions. 
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Figure 9. Graph of mans of serial dialysis by trials for inferring pre- 
^ and post-question conditions. 
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The graph of the serial analysis by trials for pre- and post- 
question conditions (Figure 7) reveals the possible beginning of the develop- 
ment of a listening skill in the posti-question condition as predicted. Suc- 
cess on trials six through ten appears to be greatef than on trials one 
through five for the post-question groups. /A closer examination of this . 
phenomenon (sec Figures 8 and 9) shows that this improvement may. be largely 
accounted for by improved scores in the post-inferring condition as opposed 

to the post- remembering condition. Given that the inferring questions seem 

\ 

to have been more difficult than the remembering questions and that the 
post-questions produced better between trials total scores, it may have 
been that the inferring questions when paired with the post- condition were 
easy enough to allow the subjects to learn how to deal with them and, thus, 
to improve their scores over trials. The pre-inferring condition may have 
been just too difficult to allow improvement of scores over trials to im- 
prove by mere exposure alone. A better test of this learning to learn 
hypothesis would be available in situations where items were of equal dif- 
ficulty across trials. Moreover, increases in the length of the listening 
task with an increased number of trials may give the learning skill a greater 
chance to develop. Increasing the number of trials and decreasing the length 
of each trial may also be a productive way to test the hypothesis. 
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CHAPTER IV 
DISCUSSION 

This chapter will present a theoretical discussion and summary 
of the results derived from the'present study and the implications these 
results have for the teaching of the mentally handicapped. 

Discussion of Results 

Hypothesis 1 of this study predicted that children receiving pre- 
questions would perform better than children who received post-questions 
on the bctwecn-trials total score (short-term retention). This was based 
on the rationale that questions could be effectively used to high-light 
information and increase short-term retention by cueing the children to 
the appropriate information to be remembered. The data did not confirm 
this hypothesis, but, in fact, showed just the opposite to be the case. 

The prediction was made in Hypothesis 2 that children would score 

>) 

higher on trials where the information necessary to answer the questions 
was in "close proximity" as opposed to "distant proximity" from the ques- 

I 

tion. The analysis of variance (Table 3) did not indicate a main effect 
due to location of information in the section and thus. Hypothesis 2 must 
be rejected as well. 

Hypothesis 3 predicted that children would perform better on trials 
where they received pre-questions and the information was close, i.e., in 
the first half ^f the section, as compared with trials where a post-ques- 

r 

tion was given and the information was close, i.e., in tho last half of the 
section. Results of the analysis of variance showed an interaction between 
question position and location of information in the section -- but in a 
direction opposite to the prediction. Results indicated that subjects did 
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better on items where they received a post-question and the information 

needed to answer the question was in the last half of the section than on 

~\ 

items where they recieved a pre-question and the information was in the 
first half of the section. 

These findings were thought to be explained best by interference 
theory. Forgetting, according to interference theory, occurs because com- 
peting responses learned before the acquisition of criterion responses 
(proactive interference) or in the retention interval (retroactive inter-, 
fcrence) somehow interfere with the- "habit" of the criterion responses. 

v 

This interference produces a decrement in criterion behavior called for- 
getting. 

In the present study the better post-question condition performance 
of subjects, as compared with subjects in the pre-question condition may 
be accounted for by the retroactive interference of the great amount of 
information presented after^the question. Thus by the time the experimenter 
, called for the response to the question in the pre-question condition, the 

subjects may well have forgotten the question. In the post -question con- 
dition, remembering the question was no problem. This explanation makes 

I 

/ .« 

the assumption that the meaningfulness and interest of the st'dry made 'it 

' more easily remembered than the relatively sterile and weak presentation 

> 

of the question. 

Interference theory may also be used in explaining the interaction 
results where the subjects performed better on the items which were post- 
questions and where the information needed to answer the question was in 
the last half of the section, than on any other combination of the two 
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variables. In looking at Figuro 1 (p. 42), the graph of this interaction, there 
seems to be a clear relationship between subjects' performance and the op- 
portunity for interference to occur. Thus, subjects do best on items where 
there is the least opportunity for interference ; the worst where there is 
the greatest opportunity for interference; and their performance is at a 
point in between these two groups whon the opportunity for interference 
is at a point half way between these two extremes (i.e., on the distant 
proximity items where the question and critical information are separated 
by half of the list oning passage) . 

Most of the research used to support interference theory is based 
upon the verbatim recall and recognition of unrelated words and rionsonso 
syllables. Ho wever, a recent review of literature in this area (Cunningham, 
1972) concludes that those studies reviewed support the assertion that 
the verbatim recall and recognition of prose follows the same laws of verbal 
interference established for the verbatim recall and recognition of unre- 
lated words and nonsense syllables. It is further contended that, though 
the research findings are still equivocal, the weight of the evidence seems 
to indicate that verbatim retention and meaning retention are comparable 
and follow the same laws. 

The application of short-term memory theory (Ellis, 1963, 1970) 
also offors a possible explanation for these findings. However, the appli- 
cation of short-term and long-term memory research conclusions would in- 
volve the application of findings generated from studies which have used 
primarily artificial learning tasks of much shorter duration than the listen- 
ing task involved in this present study. The definitional problems of the 
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1 

short-term and long-term memory concepts also Imposes restrictions and 
limitations on the application of such findings to the present study. 

However, casting these reservations aside for the moment, one may 
speculate that the subjects performed best in the post-question close prox- 
t imity condition because it placed the fewest demands on them for stimulus 

organization, rehearsal strategies and secondary memory. Resoarch reported 

in the first chapter has shown mentally retarded children to be deficient 

i / x 

! | with respect to each of these variables when compared to their normal IQ 

;/ • ' 

i peers. 

I 

j In order to test which of these orientations, if either, offers the 

| L 

| best explanation for the findings, additional data must be collected. 

j 

! By controlling the intervening activity between tho presentation of the 

t ' 

i 

! question and the information needed to answer the question, or in the post- 

| question situation, by controlling the activity between the presentation 

i 

l of the information first, followed by the question, one may be able to 

i 

! determine the relative effects of forgetting due to interference of com- 

| peting information as compared with the effects of time alone. 

| The multiple regression analyses provide further evidence for the 

j importance of better understanding the interaction of question position 

| and location of information. The regression analysis on the total between 

i I 

I trials score (Table 6, p. 49) shows mental age and question type to account 

| for approximately the same amount of variance -- 12%. However, this analysis 

. j 

j does not consider the location of critical information in the section of 

the story. If this variable is considered the relative importance of question 
position and mental age change when predicting the distant proximity scores, 
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i.e., where the critical information and question are separated by half 
of the section, mental age accounts for over 11% of the variance with ques- 
tion position accounting for scarcely more than 1.5%. However, when the 
question and information are in close proximity to each other, the effect 
of question position is vastly increased — accounting now for over 20% 
of the variance with mental age continuing to contrihute approximately 11%. 

The general conclusion to be drawn from these analyses, then, seems to bo 
that as the proximity between the question and the critical information 
increases , the more important becomes the time and activity occuring between 
the time of learning and the opportunity to give the response. (The reader 
should koep in mind that the post-question information close proximity 
group performed the best on the short-term retention test.) 

In considering the dolayed-criterion dependent measures, the following 
hypotheses were not supported by the data: Hypothesis 6 - that children 

who received questions would score higher on the relevant criterion tost 
which asked the samo questions as were in the training, than children who 
were in the control group and did not receive training questions; Hypothesis 7 
- that children who receivod pre-questions would perform better on the relevant 
criterion test than children who received post-questions; and Hypothesis 9 - 
that children would perform better on the relevant subtest than on the in- 
cidental subtest. 

These findings suggest that any differences which may have been 
obtained on the short-term retention measure wore lost by the time the 
delayed criterion tests were given 15 minutes later. Several explanations 
may be offered for theso results. 
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I 

First, the lack of any differences between the groups which received 
training questions and the control groups on the relevant and incidental 
criterion tests may be accounted for by the nature of the instructions given 
to the control group. Those children who participated as control subjects 
were strongly encouraged to listen very carefully to the story and told 
that’they would be asked questions about the story some time after it was 
over. While this would have provided the strongest test for the effect of 
questioning, it, combined with the obvious effects of participating in a 
study (Hawthorne effect), may have served to sufficiently boost the scores 
of the control group to make it appear as if the training questions had 
little long term effect. Efforts to eliminate this effect must be made in 
future attempts at creating control groups for studies of this kind. More- 
over, it should not be concluded that the control group represented a reason 
able facsimile to the "real classroom" and that questions therefore make no 
difference. The fact that each child received individual instructions to 
pay close attention to a story which was intended to have high interest 
value and was read by an enthusiastic professional story teller, is a clear 
departure from the everyday routine of the classroom. 

A second possible explanation, and one which would include the lack 
of any signficant difference between the relevant and incidental criterion 
scores, is that the children may have been quite tired after the 15 minute 
rest activity plus the 15-20 minutes it took to get through the story and 
questions. This would have served to produce a poorer performance than 
might otherwise have been expected on a task of shorter duration. 
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A third possible explanation may be that the length of each section 
of the story, reduced the effectiveness of the training questions and thus 
reduced the power of the treatment overall. Studies which shortened the 
length of the sections of the information to be learned would be likely 
to increase the number of between trials items the subject got correct. 

This would then increase the likelihood of his remembering the correct 
response to the relevant items through a straight-forward stimulus-response 
association. 

The lack of a significant interaction effect between criterion typo 
and question type in the analysis of variance of incidental delayed criterion 
test scores also forced the rejection of two additional hypotheses: Hypoth- 

esis 12 - that children who received remembering training questions would 
score higher on the remembering incidental criterion test than children 
who had received inferring training questions; and Hypothesis 13 - that 
children who received inferring training questions would perform bettor on 
the inferring incidental criterion test than children who had received re- 
membering training questions. 

The analysis of variance on the relevant delayed criterion tost 
scores (Table 8) showed a significant interaction effect between question 
type and criterion typo and thus provided confirmation of Hypothesis 10 — 
that children who received remembering training questions would score 
higher on the remembering relevant criterion test than children who had 
received inferring training questions and Hypothesis 11 - that children who 
received inferring training questions would perform better on the inferring 
relevant criterion test than children who had received rememboring training 



questions. The analysis of variance on the total criterion score (Table 12) 
produced a similar result and, thus, gave partial confirmation to Hypothesis 
12 - that the total score criterion test results would be highest for children 



lationship held only for the case whore the inferring training question 
group had done better on the inferring criterion test than the remembering 
training question group on the same test; those who received remembering 
training questions did not perform significantly better than thoso who had 
received inferring training questions on the remembering criterion test.) 



These results were taken as support for the general thesis that the 
activities a student engages in when confronted withi instructional tasks 

I 

are of critical importance in determining what he learns. The alternative 
view of course would be that the student is a passive receptacle whose learn- 
ing and performance are directly determined by input variables. The reader 
interested in this debate is directed to Anderson's 1970 article on "Control 
of student mediating processes during verbal learning and instruction." 

In general, then, it can be said that different types of questions 
may only be useful and necessary to the extent to which they facilitate the 
"desired" kind of learning. If one wanted children to perform well on a 
remembering criterion test, then inferring training questions would obviously 
be inappropriate. And, if one wanted a child to do well on an inferring 
criterion test, then remembering questions would be out of place as part of 
the training. It is not completely clear, however, whether this effect is 
a function of the facilitating effect of the appropriate type question 
relative to the criterion task, or due to the suppressing effect of the in- 



receiving* the same type of criterion test as training question. 
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appropriate type question. A close inspection of the means (Table 7) in- 
dicates that the inappropriate type question group performed slightly worse 
than the control group in each case. It should also be recalled that the 
relevant group did not perform significantly better than the control group 
on the delayed criterion test. Thus, the issue of the relative facilitating 
effects of the similar type training questions and criterion questions 
versus the possible suppressing effect of the dissimilar type training 
and criterion question remains clouded. Further research will be needed 
to unravel this important and perplexing issue. 

Nevertheless, this significant interaction effect suggests what 
Rosenshinc and Furst (1971) have called the "cognitive process" opportunity- 
to- learn phenomenon. This phenomenon refers to the important consideration 
of whether or not the level of the criterion instrument was relevant to, 
and of the same type as, the instruction. Overall, Rosenshine and Furst 
(1971) report, the correlations between measures of opportunity to learn 
and student achievement have been positive, significant, and consistent. 

This claim gained additional support with the results of a recent study 
(Watts and Anderson, 1971) which demonstrated that high school students 
did best on the criterion test when the questions they received were of 
the same type as those in the test. 

The results of the analysis of variance of the incidental delayed 
criterion test scores (p. 76) indicated that there were no significant dif- 
ferences with respect to the question position - criterion type interaction, 
and, thus, led to the rejection of Hypothesis IS - that children who re- 
ceived post -training questions would perform better than children who roceived 
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pre-training questions on the incidental criterion test where the items 
in tho criterion test were of the same type as the training questions. 

This finding is not surprising given the clear lack of power and influence 
of the question position variable after the IS minute delay. (Regression 
analysis (Table 14, p. 72) showed the question position variable to account 

for less than 1% of performance.) One must conclude from this evidence 

; . j 

thut the position of the training questions was not effective in ’facilitating 
the development of those mathemagenic behaviors necessary for the develop- 
ment of listening skills or learning sets. The fact that the development 
of listening skills was not supported by the delayed criterion test data, 
however, does not rule out the possibility that these learning sets were 
i developed during the listening task and lacked the strength to persist over 

i the period of tho IS minute interval. 

The results of the serial analysis by trials were able to shed 
some light on this possibility. General confirmation of this possibility 
cannot be claimed. Though tho fact that children who receive i pre-questions 
did not show improvement over trials (and thus led to the acceptance of 
Hypothesis 5), the results did not confirm the general improvement of chil- 
j dron who received post -questions over trials, and thus led to the rejection 

of Hypothesis 4. The graph of this interaction, however, indicates the 

f . 

possible beginnings of the development of a learning sot in the inferring 

i 

post-quostion condition (see Figure 9, p. 82). The relatively better per- 

J 

formance of the post-inferring questions as compared with the pre-inferring 
| questions during the last half of the listening task and tho better perform- 

[ ance of the subjects on the last half of the post-inferring questions sug- 

r 

l 

> 
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gests the beginnings of a learning to learn phenomenon. Research which 
increased the number of trials during a shorter period of time or increased 
the number of trials and lengthened the time of the task might shed further 
light on the development of these phenomena. Inasmuch as other research 
(e.g., Kaufman, 1971) suggests that the development of learning sets may 
be enhanced by overlearning, increased amount of training in tho post- 
question condition may provide evidence for the development of listening 
skills through skillful questioning. One possibility for testing this 
hypothesis would be to have multiple training sessions over a period of 
days followed by the criterion test. 

Implications for Teaching the Mentally Handicapped 

Research reported in the review of tho relevant literature has dem- 
onstrated that teachers ask many questions during classroom instructional 
activities. While the results of the present study cannot bo used to support 
or reject the validity of this type of teacher behavior in total, thoy can 
be useful in suggesting that studies which record merely the frequency 
of questioning in the classroom seem to be missing essential ingredients 
in productive educational practice. 

One of the results of this study suggests that the temporal prox- 
imity of a question in relation to the information to be loarned is an essen- 
tial relationship in the teaching of EMR children. The suggestion to teachers 
of the mentally handicapped, then, must be that their questioning practices 
be related to the information which is being presented in such a way that 
small units of information are quickly followed up by questions. If tho 
interval between the presentation of the information and the question 
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is too great or, if too much information is presontcd at once, the short- 
term effectiveness of the questioning will be soverely reduced. Moreover, 
the findings suggest that questioning of this nature will be more effective 
when it follows the presentation of information as opposed to when it pre- 
cedes it. 

\ 

Teacher questioning effectiveness may take many forms. One of 

i 

these is to facilitate the retention of information which a teacher might 
want students to remember. A second may be to draw attention to critical 
attributes in concept formation tasks. A third purpose for asking ques- 
tions may be to give practice in different types of thinking. 

Often it is assumed that retarded children are not capable of "higher 
lovels" of information processing. The present study has not only demon- 
strated this not to be true, but has, in fact, shown that questions can 
be used to stimulate children to think in different ways and that questions 
can be used to produce different types of learning. This may be considered 
support for the general notion that the activities students engage in when 
confronted with instructional tasks are of crucial importance in determining 
what they will learn and further, that pupils are thus active agents in 
their own learning. Retarded children appear to be no different in this 
respect. However, in teaching the retarded, more attention must be given 
to the management of these student activities as they tend not to process 
information as effectively as normal children, or engage in appropriate 
self-control over their own study skill behavior. Questions can be useful 
educational tools to these ends. 

Results of this study also suggest an important relationship between 
the type of question asked during instruction and tho type of criterion 
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performance the teacher desires of the students. The type of question 
one asks seems to be important only to the extent that it is consistent 
with the typo of learning dosirod. Desired student achievement would thus 
sown to bo increased by teachers' being able to clearly state their ins true* 
tional goals in terms of "cognitive process typos" as well as in terms of content. 

The findings also soem to have implications for the frequency of ap- 
propriate questioning of individual students; and especially of EMK students. 
Significant relationships have been demonstrated between the opportunity a 
child has to respond to questions and school achievement (Van. Wagenen and 
Travers, 1963; Travers et al., 1964). Yet several studies show that children 
perceived as slower by their teachers tend to be slighted in classroom inter- 
actions by having fewer questions directed to them (e.g,, Brophy and Good, 

1970; Lynch and Ames* 1971). If timely questioning is important for short- 
term retention (as this study has shown it to be), then it seems logical 
to conclude that children who are perceived as slow learners may not learn 
as much as they possibly could in classroom situations where they do not 
receive frequent questioning. This further supports the importance of small 
teacher-student ratios in the teaching of mentally retarded children as well 
as the need for frequent teacher-student verbal interaction. 

The general conclusion of this study must be that questions can 
play an important role in the teaching of mentally handicapped children if 
the right kind of question is asked at the right time. The right type of 
question must bo defined in terms of the desired learning outcomes and the 
right time is in close proximity and after the information to be learned 
has been presented. 
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CHAPTER V 
SUMMARY 

Tills study investigated the effects of question type and position 
on four types of lcamingamong mentally handicapped children. 

Sixty years of descriptive research had demonstrated that questions 
played an important role in the daily instructional activities of teachors. 
Further findings of these studies indicated that approximately two-thirds 
of the questions asked required direct recall of information prosonted in 
class. Studies of this type showed this percentage to be similar among 
various different levels of classrooms and subject areas. Additional evi- 
dence suggested similar percentages among special education classes. 

While educators have been concerned with the type and frequency 
of questions asked in classrooms for a long time, and more recently with 
questions in special education classes, little research had been done on 
the effects of different types of questions. 

Several studies indicated that educable mentally retarded (EMR) 
children wore capable of productive thinking. Yet other research indicated 
that low-ability children and children for whom teachers had low expectations 
missed out on opportunities for intellectual stimulation from the teacher 
and that teachers frequently called on students they expected to give the 
right answer. Still, it remained to be demonstrated that questions were 
capable of being effective stimuli in providing opportunities for this kind 
of intellectual stimulation and productive thought in general and among EMR 
children in particular. This study represented one such effort. 
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Rased on previous empirical studios concerned with question type, 
research on mathemagenic behavior (e.g., Rothkopf, 1966; Frase 1968a, 1969b), 
and relevant characteristics of educable mentally retarded children derived 
from theory and research (e.g., Ellis, 1963, 1970; Sentinel, 1965; Spitz, 

1966; Zeaman and House, 1963) fifteen different hypotheses wore formulated. 

In general, these hypotheses predicted that pre -questions would produce 
better learning than post-questions; that questions would be most offoctive 
whon the information needed to answer the question was close as opposed to 
distant; and that subjects would improve over trials. The data were all 
generated on between trials items and are referred to as the short-teijm 
retention test. 

Hypotheses concerned with the delayed criterion test predicted that: 
subjocts who received questions would perform better on the criterion test 
than control subjects who did not receive training questions; subjects who 
received the same type of criterion test question as training question would 
perform better than those subjects who received different types of training 
and criterion questions; subjects who received pre-questions would perform 
best on the relevant criterion test with subjects who received post-questions 
performing best on the incidental criterion test; and that subjects who 
received postrquestions would show the development of a listening skill. 

The subjects for this study were 90 intermediate EMR children. 

Each subject was asked to listen to a high interest-low vocabulary short 
mystery story of approximately 2400 words. This story was broken down into 
a total of ten sections. 



99 




i 



j 

t 

i 



( 




ERLC 



Subjects were randomly selected from cooperating schools and class- 
rooms and taken one at a time to the experimental room by the experimenter. 
After establishing rapport with the subject the experimenter indicated 
that they would be listening to a short story. The experimenter introduced 
tho story with a general statement describing what the story was about, 
flie subjects were tol that they would be askod some questions after they 
had finished tho story. 

During this listening activity the experimenter systematically in- 
troduced the questions. There were two types (remembering or inferring) 
in one of two positions (pre or post). In the pre-remembering question 
condition the experimenter asked a remembering question before trt'i subject 
listened to each section. As was the case in all conditions the subjects 
wore instructed to rospond to the questions orally after listening to tho 
section. The pre-inferring question group was asked one inferring question 
before each paragraph was heard. In the post- conditions the questions 
were asked one at a time, but after each paragraph was heard. There were 
a total of ton paragraphs and thus ten questions. A control group listened 
to the short story but without receiving any questions. All questions in 

both tho treatment phases and on the cirtorion test wore of tho free recall 
variety. 

The criterion test was administered IS minutes after tho experimental 
activity. This test was of two typos and consisted of a total of 20 items. A 
subject could received either 20 romombering items or 20 inferring items. Those 
20 items were brokon up into subtests of 10 items each. The relevant subtest 
consisted of items which wore the same as those received during the listening 
activity. Tho remaining 10 items were of the same type and covered material 
included in tho story though the subject had never received these items before. 
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The 15 minute rest activity between the end of the listening activity 
and the beginning of the criterion test consisted of playing with wooden 
blocks. 

The data were analyzed using analysis of variance and stepwise 
multiple regression procedures. Results of the study indicated that import- 
ant educational advantages may be derived from asking questions. On the 
short-term retention test, subjects who were asked questions in the closest 
temporal proximity to, and after the presentation of the critical inform- 
ation, were found to perform best. In addition to this, post-questions 
were found to produce better learning than pre-questions. While no statistic- 
ally significant evidence indicated subject improvement over trials a trend 
in that direction was indicated for subjects who had received post-inferring 
training questions. The findings also indicated that approximately 12$ 
of the variance on the short-term retention tost (total score) was accounted 
for by the quostion position variable. ■ 

Results on the dolayed criterion test indicated an interaction 
betwoen training question typo and critorion type with the best performance 
generally being obtained when tho training questions and items on the cri- 
| terion tost were of the same type. No statistical evidence was found which 

supported the development of learning sets or listening skills. The results 
showed the question position variable to account for approximately 1% of 
tho total variance on the delayed criterion test, thus indicating its dimin- 
ishing effects over time. 

Consistent differences were obtained with respect to the quostion 

t. • 

type variable with subjects who recoivcd remembering items outperforming 
| those who roceived inferring items. . 
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Thus, support for all hypotheses was not indicated by the data anal- 
j ysis. Interference and memory theory and research were used to explain 

I 

the findings. 

i Implications of the results for education were discussed with the 

| 

general conclusion being that questions seem to offer great valuo for the 
teaching of the mentally handicapped when they are asked at the right time 
j and with the right objective in mind. The right time was defined as being 

i * 

| in close temporal proximity to the presentation of information and after 

j 

| the presentation of this information. Further, it was noted that the ques- 

i 

tions were of greatest value when they were of the . same type as the type 
of question asked on the criterion test. 

i 
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APPENDIX A 
THE STORY 

1. Leroy Clark watched Mr. Pumkin leave the house. Mr. Pumkin had 
rented a room from Leroy's paronts a week before, but none of them had 
learned very much about him. 

Every morning at 7 o'clock Mr. Pumkin went off to work. Leroy 
alwaiys knew what Mr. Pumkin would be wearing. His shoes were so shiny 
that you could see your face in them. He woro a blue suit and a grey 
hat. Mr. Pumkin always wore the same thing. 

No one knew where Mr. Pumkin worked. He would only say that he 
worked with money. He was such a well-dressed man that it looked as if 
ho could buy his own house. Everyone thought it was strange that such 
a well-dressed man as Mr. Pumkin rented a room in Leroy's part of town. 

Leroy watched as Mr. Pumkin went down the front steps to the big, 
black car parked in the street. No one in Leroy's neighborhood had ever 
owned such a big, new car. 

Mr. Pumkin stopped beside his car. Leroy saw him reach far down 
in his pocket and take out his keys. And, at the same time, he saw some- 
thing small and yellow fall out of Mr. Pumkin's pocket. 

2. But Mr. Pumkin didn't seem to know that he had dropped something. 
He opened the car door, got in, started the engine and waved good-bye 
through the open window. 

Leroy hurried from the window to the front door. "Mr. Pumkin'; 

Mr. Pumkin l" he shouted. But Mr. Pumkin couldn't hear him. 

Leroy ran to the street and looked down at the small piece of 
paper at his feet. When he picked it up, he saw that it was a large piece 
of paper that had been folded three times. 

He looked down the street. Mr. Pumkin's car had gone away. He 
knew he shouldn't unfold the paper, even though he wanted to know what 
was inside. Mr.. Pumkin had not given him permission to look at it. Leroy 
thought about taking the paper to Mr. Pumkin's room on the third floor or 
leaving the note at the foot of the stairs. But then Mr. Pumkin might 
not see it, or Leroy's mother might throw it away when she cleaned the 
house . 

j All at once, Leroy unfolded the paper. He knew it was wrong, but 
he had to know what was inside. He looked down at the paper, surprised 
by what he saw. It wasn't a letter at all. It was a map. 

3. Leroy looked at the map for a long time. He followed each line 
with his finger, starting with the word "house." Next came "chimney," 
followed by "yard," "post," and "tree." There were five words altogether. 
Leroy wondered what it all meant. 

He had read stories about buried treasure. Could Mr. Pumkin have 
a treasure buried? 
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